Dataset statistics
| Number of variables | 39 |
|---|---|
| Number of observations | 119143 |
| Missing cells | 204813 |
| Missing cells (%) | 4.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 36.4 MiB |
| Average record size in memory | 320.0 B |
Variable types
| Text | 11 |
|---|---|
| Categorical | 5 |
| DateTime | 8 |
| Numeric | 15 |
price is highly overall correlated with payment_value and 1 other fields | High correlation |
payment_value is highly overall correlated with price | High correlation |
product_weight_g is highly overall correlated with price and 3 other fields | High correlation |
product_length_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
product_height_cm is highly overall correlated with product_weight_g | High correlation |
product_width_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
customer_zip_code_prefix is highly overall correlated with customer_state | High correlation |
seller_zip_code_prefix is highly overall correlated with seller_state | High correlation |
customer_state is highly overall correlated with customer_zip_code_prefix | High correlation |
seller_state is highly overall correlated with seller_zip_code_prefix | High correlation |
order_status is highly imbalanced (91.6%) | Imbalance |
payment_type is highly imbalanced (52.6%) | Imbalance |
seller_state is highly imbalanced (63.3%) | Imbalance |
order_delivered_carrier_date has 2086 (1.8%) missing values | Missing |
order_delivered_customer_date has 3421 (2.9%) missing values | Missing |
review_comment_title has 105154 (88.3%) missing values | Missing |
review_comment_message has 68898 (57.8%) missing values | Missing |
product_category_name has 2542 (2.1%) missing values | Missing |
product_name_lenght has 2542 (2.1%) missing values | Missing |
product_description_lenght has 2542 (2.1%) missing values | Missing |
product_photos_qty has 2542 (2.1%) missing values | Missing |
Reproduction
| Analysis started | 2023-07-25 00:21:08.636415 |
|---|---|
| Analysis finished | 2023-07-25 00:21:54.117148 |
| Duration | 45.48 seconds |
| Software version | ydata-profiling vv4.2.0 |
| Download configuration | config.json |
order_id
Text
| Distinct | 99441 |
|---|---|
| Distinct (%) | 83.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3812576 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 86494 ? |
|---|---|
| Unique (%) | 72.6% |
Sample
| 1st row | e481f51cbdc54678b7cc49136f2d6af7 |
|---|---|
| 2nd row | e481f51cbdc54678b7cc49136f2d6af7 |
| 3rd row | e481f51cbdc54678b7cc49136f2d6af7 |
| 4th row | 128e10d95713541c87cd1a2e48201934 |
| 5th row | 0e7e841ddf8f8f2de2bad69267ecfbcf |
| Value | Count | Frequency (%) |
| 895ab968e7bb0d5659d16cd74cd1650c | 63 | 0.1% |
| fedcd9f7ccdc8cba3a18defedd1a5547 | 38 | < 0.1% |
| fa65dad1b0e818e3ccc5cb0e39231352 | 29 | < 0.1% |
| ccf804e764ed5650cd8759557269dc13 | 26 | < 0.1% |
| c6492b842ac190db807c15aff21a7dd6 | 24 | < 0.1% |
| a3725dfe487d359b5be08cac48b64ec5 | 24 | < 0.1% |
| 465c2e1bee4561cb39e0db8c5993aafc | 24 | < 0.1% |
| 6d58638e32674bebee793a47ac4cbadc | 24 | < 0.1% |
| 68986e4324f6a21481df4e6e89abcf01 | 24 | < 0.1% |
| 285c2e15bebd4ac83635ccc563dc71f4 | 22 | < 0.1% |
| Other values (99431) | 118845 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 239371 | 6.3% |
| b | 239344 | 6.3% |
| 6 | 239250 | 6.3% |
| e | 238956 | 6.3% |
| 3 | 238724 | 6.3% |
| c | 238563 | 6.3% |
| 8 | 238518 | 6.3% |
| 7 | 238501 | 6.3% |
| 1 | 238422 | 6.3% |
| a | 238162 | 6.2% |
| Other values (6) | 1424765 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2382278 | |
| Lowercase Letter | 1430298 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 239371 | |
| 6 | 239250 | |
| 3 | 238724 | |
| 8 | 238518 | |
| 7 | 238501 | |
| 1 | 238422 | |
| 2 | 237912 | |
| 9 | 237674 | |
| 0 | 237062 | |
| 5 | 236844 |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 239344 | |
| e | 238956 | |
| c | 238563 | |
| a | 238162 | |
| f | 237877 | |
| d | 237396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2382278 | |
| Latin | 1430298 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 239371 | |
| 6 | 239250 | |
| 3 | 238724 | |
| 8 | 238518 | |
| 7 | 238501 | |
| 1 | 238422 | |
| 2 | 237912 | |
| 9 | 237674 | |
| 0 | 237062 | |
| 5 | 236844 |
Latin
| Value | Count | Frequency (%) |
| b | 239344 | |
| e | 238956 | |
| c | 238563 | |
| a | 238162 | |
| f | 237877 | |
| d | 237396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3812576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 239371 | 6.3% |
| b | 239344 | 6.3% |
| 6 | 239250 | 6.3% |
| e | 238956 | 6.3% |
| 3 | 238724 | 6.3% |
| c | 238563 | 6.3% |
| 8 | 238518 | 6.3% |
| 7 | 238501 | 6.3% |
| 1 | 238422 | 6.3% |
| a | 238162 | 6.2% |
| Other values (6) | 1424765 |
customer_id
Text
| Distinct | 99441 |
|---|---|
| Distinct (%) | 83.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3812576 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 86494 ? |
|---|---|
| Unique (%) | 72.6% |
Sample
| 1st row | 9ef432eb6251297304e76186b10a928d |
|---|---|
| 2nd row | 9ef432eb6251297304e76186b10a928d |
| 3rd row | 9ef432eb6251297304e76186b10a928d |
| 4th row | a20e8105f23924cd00833fd87daa0831 |
| 5th row | 26c7ac168e1433912a51b924fbd34d34 |
| Value | Count | Frequency (%) |
| 270c23a11d024a44c896d1894b261a83 | 63 | 0.1% |
| 13aa59158da63ba0e93ec6ac2c07aacb | 38 | < 0.1% |
| 9af2372a1e49340278e7c1ef8d749f34 | 29 | < 0.1% |
| 92cd3ec6e2d643d4ebd0e3d6238f69e2 | 26 | < 0.1% |
| 6ee2f17e3b6c33d6a9557f280edd2925 | 24 | < 0.1% |
| d22f25a9fadfb1abbc2e29395b1239f4 | 24 | < 0.1% |
| 63b964e79dee32a3587651701a2b8dbf | 24 | < 0.1% |
| 2ba91e12e5e4c9f56b82b86d9031d329 | 24 | < 0.1% |
| 86cc80fef09f7f39df4b0dbce48e81cb | 24 | < 0.1% |
| b246eeed30b362c09d867b9e598bee51 | 22 | < 0.1% |
| Other values (99431) | 118845 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 239014 | 6.3% |
| 2 | 238989 | 6.3% |
| 5 | 238748 | 6.3% |
| c | 238647 | 6.3% |
| 6 | 238555 | 6.3% |
| 1 | 238489 | 6.3% |
| 8 | 238361 | 6.3% |
| d | 238357 | 6.3% |
| 7 | 238270 | 6.2% |
| a | 238267 | 6.2% |
| Other values (6) | 1426879 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2381876 | |
| Lowercase Letter | 1430700 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 238989 | |
| 5 | 238748 | |
| 6 | 238555 | |
| 1 | 238489 | |
| 8 | 238361 | |
| 7 | 238270 | |
| 3 | 238161 | |
| 9 | 238003 | |
| 4 | 237276 | |
| 0 | 237024 |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 239014 | |
| c | 238647 | |
| d | 238357 | |
| a | 238267 | |
| e | 238226 | |
| b | 238189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2381876 | |
| Latin | 1430700 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 238989 | |
| 5 | 238748 | |
| 6 | 238555 | |
| 1 | 238489 | |
| 8 | 238361 | |
| 7 | 238270 | |
| 3 | 238161 | |
| 9 | 238003 | |
| 4 | 237276 | |
| 0 | 237024 |
Latin
| Value | Count | Frequency (%) |
| f | 239014 | |
| c | 238647 | |
| d | 238357 | |
| a | 238267 | |
| e | 238226 | |
| b | 238189 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3812576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 239014 | 6.3% |
| 2 | 238989 | 6.3% |
| 5 | 238748 | 6.3% |
| c | 238647 | 6.3% |
| 6 | 238555 | 6.3% |
| 1 | 238489 | 6.3% |
| 8 | 238361 | 6.3% |
| d | 238357 | 6.3% |
| 7 | 238270 | 6.2% |
| a | 238267 | 6.2% |
| Other values (6) | 1426879 |
order_status
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| delivered | |
|---|---|
| shipped | 1256 |
| canceled | 750 |
| unavailable | 652 |
| invoiced | 378 |
| Other values (3) | 384 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.9834401 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1070314 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | delivered |
|---|---|
| 2nd row | delivered |
| 3rd row | delivered |
| 4th row | delivered |
| 5th row | delivered |
Common Values
| Value | Count | Frequency (%) |
| delivered | 115723 | |
| shipped | 1256 | 1.1% |
| canceled | 750 | 0.6% |
| unavailable | 652 | 0.5% |
| invoiced | 378 | 0.3% |
| processing | 376 | 0.3% |
| created | 5 | < 0.1% |
| approved | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delivered | 115723 | |
| shipped | 1256 | 1.1% |
| canceled | 750 | 0.6% |
| unavailable | 652 | 0.5% |
| invoiced | 378 | 0.3% |
| processing | 376 | 0.3% |
| created | 5 | < 0.1% |
| approved | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 351344 | |
| d | 233838 | |
| i | 118763 | 11.1% |
| l | 117777 | 11.0% |
| v | 116756 | 10.9% |
| r | 116107 | 10.8% |
| p | 2894 | 0.3% |
| a | 2714 | 0.3% |
| c | 2259 | 0.2% |
| n | 2156 | 0.2% |
| Other values (7) | 5706 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1070314 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 351344 | |
| d | 233838 | |
| i | 118763 | 11.1% |
| l | 117777 | 11.0% |
| v | 116756 | 10.9% |
| r | 116107 | 10.8% |
| p | 2894 | 0.3% |
| a | 2714 | 0.3% |
| c | 2259 | 0.2% |
| n | 2156 | 0.2% |
| Other values (7) | 5706 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1070314 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 351344 | |
| d | 233838 | |
| i | 118763 | 11.1% |
| l | 117777 | 11.0% |
| v | 116756 | 10.9% |
| r | 116107 | 10.8% |
| p | 2894 | 0.3% |
| a | 2714 | 0.3% |
| c | 2259 | 0.2% |
| n | 2156 | 0.2% |
| Other values (7) | 5706 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1070314 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 351344 | |
| d | 233838 | |
| i | 118763 | 11.1% |
| l | 117777 | 11.0% |
| v | 116756 | 10.9% |
| r | 116107 | 10.8% |
| p | 2894 | 0.3% |
| a | 2714 | 0.3% |
| c | 2259 | 0.2% |
| n | 2156 | 0.2% |
| Other values (7) | 5706 | 0.5% |
| Distinct | 98875 |
|---|---|
| Distinct (%) | 83.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Minimum | 2016-09-04 21:15:19 |
|---|---|
| Maximum | 2018-10-17 17:30:18 |
| Distinct | 90733 |
|---|---|
| Distinct (%) | 76.3% |
| Missing | 177 |
| Missing (%) | 0.1% |
| Memory size | 1.8 MiB |
| Minimum | 2016-09-15 12:16:38 |
|---|---|
| Maximum | 2018-09-03 17:40:06 |
| Distinct | 81018 |
|---|---|
| Distinct (%) | 69.2% |
| Missing | 2086 |
| Missing (%) | 1.8% |
| Memory size | 1.8 MiB |
| Minimum | 2016-10-08 10:34:01 |
|---|---|
| Maximum | 2018-09-11 19:48:28 |
| Distinct | 95664 |
|---|---|
| Distinct (%) | 82.7% |
| Missing | 3421 |
| Missing (%) | 2.9% |
| Memory size | 1.8 MiB |
| Minimum | 2016-10-11 13:46:32 |
|---|---|
| Maximum | 2018-10-17 13:22:46 |
| Distinct | 459 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Minimum | 2016-09-30 00:00:00 |
|---|---|
| Maximum | 2018-11-12 00:00:00 |
order_item_id
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.196543 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6994889 |
|---|---|
| Coefficient of variation (CV) | 0.58459154 |
| Kurtosis | 103.35482 |
| Mean | 1.196543 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.5517266 |
| Sum | 141563 |
| Variance | 0.48928472 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 103645 | |
| 2 | 10317 | 8.7% |
| 3 | 2396 | 2.0% |
| 4 | 995 | 0.8% |
| 5 | 472 | 0.4% |
| 6 | 265 | 0.2% |
| 7 | 61 | 0.1% |
| 8 | 37 | < 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 26 | < 0.1% |
| Other values (11) | 67 | 0.1% |
| (Missing) | 833 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 103645 | |
| 2 | 10317 | 8.7% |
| 3 | 2396 | 2.0% |
| 4 | 995 | 0.8% |
| 5 | 472 | 0.4% |
| 6 | 265 | 0.2% |
| 7 | 61 | 0.1% |
| 8 | 37 | < 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 26 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 3 | < 0.1% |
| 19 | 3 | < 0.1% |
| 18 | 3 | < 0.1% |
| 17 | 3 | < 0.1% |
| 16 | 3 | < 0.1% |
| 15 | 5 | < 0.1% |
| 14 | 7 | |
| 13 | 8 | |
| 12 | 13 |
product_id
Text
| Distinct | 32951 |
|---|---|
| Distinct (%) | 27.9% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3785920 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17345 ? |
|---|---|
| Unique (%) | 14.7% |
Sample
| 1st row | 87285b34884572647811a353c7ac498a |
|---|---|
| 2nd row | 87285b34884572647811a353c7ac498a |
| 3rd row | 87285b34884572647811a353c7ac498a |
| 4th row | 87285b34884572647811a353c7ac498a |
| 5th row | 87285b34884572647811a353c7ac498a |
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 536 | 0.5% |
| 99a4788cb24856965c36a24e339b6058 | 528 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 508 | 0.4% |
| 389d119b48cf3043d311335e499d9c6b | 406 | 0.3% |
| 368c6c730842d78016ad823897a372db | 398 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 391 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 357 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 327 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 295 | 0.2% |
| 3dd2a17168ec895c781a9191c1e95ad7 | 278 | 0.2% |
| Other values (32941) | 114286 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 243128 | 6.4% |
| 9 | 241092 | 6.4% |
| e | 238897 | 6.3% |
| 8 | 238246 | 6.3% |
| 7 | 238157 | 6.3% |
| 4 | 237487 | 6.3% |
| a | 237289 | 6.3% |
| c | 236394 | 6.2% |
| 0 | 236277 | 6.2% |
| 2 | 236110 | 6.2% |
| Other values (6) | 1402843 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2375835 | |
| Lowercase Letter | 1410085 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 243128 | |
| 9 | 241092 | |
| 8 | 238246 | |
| 7 | 238157 | |
| 4 | 237487 | |
| 0 | 236277 | |
| 2 | 236110 | |
| 6 | 235751 | |
| 5 | 235615 | |
| 1 | 233972 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 238897 | |
| a | 237289 | |
| c | 236394 | |
| b | 235053 | |
| d | 232751 | |
| f | 229701 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2375835 | |
| Latin | 1410085 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 243128 | |
| 9 | 241092 | |
| 8 | 238246 | |
| 7 | 238157 | |
| 4 | 237487 | |
| 0 | 236277 | |
| 2 | 236110 | |
| 6 | 235751 | |
| 5 | 235615 | |
| 1 | 233972 |
Latin
| Value | Count | Frequency (%) |
| e | 238897 | |
| a | 237289 | |
| c | 236394 | |
| b | 235053 | |
| d | 232751 | |
| f | 229701 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3785920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 243128 | 6.4% |
| 9 | 241092 | 6.4% |
| e | 238897 | 6.3% |
| 8 | 238246 | 6.3% |
| 7 | 238157 | 6.3% |
| 4 | 237487 | 6.3% |
| a | 237289 | 6.3% |
| c | 236394 | 6.2% |
| 0 | 236277 | 6.2% |
| 2 | 236110 | 6.2% |
| Other values (6) | 1402843 |
seller_id
Text
| Distinct | 3095 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3785920 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 487 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 3504c0cb71d7fa48d967e0e4c94d59d9 |
|---|---|
| 2nd row | 3504c0cb71d7fa48d967e0e4c94d59d9 |
| 3rd row | 3504c0cb71d7fa48d967e0e4c94d59d9 |
| 4th row | 3504c0cb71d7fa48d967e0e4c94d59d9 |
| 5th row | 3504c0cb71d7fa48d967e0e4c94d59d9 |
| Value | Count | Frequency (%) |
| 4a3ca9315b744ce9f8e9374361493884 | 2155 | 1.8% |
| 6560211a19b47992c3666cc44a7e94c0 | 2130 | 1.8% |
| 1f50f920176fa81dab994f9023523100 | 2017 | 1.7% |
| cc419e0650a3c5ba77189a1882b7556a | 1893 | 1.6% |
| da8622b14eb17ae2831f4ac5b9dab84a | 1662 | 1.4% |
| 955fee9216a65b617aa5c0531780ce60 | 1530 | 1.3% |
| 1025f0e2d44d7041d6cf58b6550e0bfa | 1477 | 1.2% |
| 7c67e1448b00f6e969d365cea6b010ab | 1463 | 1.2% |
| 7a67c85e85bb2ce8582c35f2203ad736 | 1245 | 1.1% |
| ea8482cd71df3c1969d7b9473ff13abc | 1240 | 1.0% |
| Other values (3085) | 101498 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 256833 | 6.8% |
| c | 250006 | 6.6% |
| 4 | 248481 | 6.6% |
| 6 | 243514 | 6.4% |
| 0 | 242843 | 6.4% |
| a | 241350 | 6.4% |
| b | 240801 | 6.4% |
| 3 | 240746 | 6.4% |
| 9 | 235027 | 6.2% |
| 2 | 233698 | 6.2% |
| Other values (6) | 1352621 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2394603 | |
| Lowercase Letter | 1391317 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 256833 | |
| 4 | 248481 | |
| 6 | 243514 | |
| 0 | 242843 | |
| 3 | 240746 | |
| 9 | 235027 | |
| 2 | 233698 | |
| 8 | 231747 | |
| 5 | 231055 | |
| 7 | 230659 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 250006 | |
| a | 241350 | |
| b | 240801 | |
| e | 222598 | |
| f | 219169 | |
| d | 217393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2394603 | |
| Latin | 1391317 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 256833 | |
| 4 | 248481 | |
| 6 | 243514 | |
| 0 | 242843 | |
| 3 | 240746 | |
| 9 | 235027 | |
| 2 | 233698 | |
| 8 | 231747 | |
| 5 | 231055 | |
| 7 | 230659 |
Latin
| Value | Count | Frequency (%) |
| c | 250006 | |
| a | 241350 | |
| b | 240801 | |
| e | 222598 | |
| f | 219169 | |
| d | 217393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3785920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 256833 | 6.8% |
| c | 250006 | 6.6% |
| 4 | 248481 | 6.6% |
| 6 | 243514 | 6.4% |
| 0 | 242843 | 6.4% |
| a | 241350 | 6.4% |
| b | 240801 | 6.4% |
| 3 | 240746 | 6.4% |
| 9 | 235027 | 6.2% |
| 2 | 233698 | 6.2% |
| Other values (6) | 1352621 |
| Distinct | 93318 |
|---|---|
| Distinct (%) | 78.9% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Memory size | 1.8 MiB |
| Minimum | 2016-09-19 00:15:34 |
|---|---|
| Maximum | 2020-04-09 22:35:08 |
price
Real number (ℝ)
| Distinct | 5968 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.6466 |
| Minimum | 0.85 |
|---|---|
| Maximum | 6735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 39.9 |
| median | 74.9 |
| Q3 | 134.9 |
| 95-th percentile | 349.9 |
| Maximum | 6735 |
| Range | 6734.15 |
| Interquartile range (IQR) | 95 |
Descriptive statistics
| Standard deviation | 184.10969 |
|---|---|
| Coefficient of variation (CV) | 1.5260247 |
| Kurtosis | 119.15494 |
| Mean | 120.6466 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 7.8925735 |
| Sum | 14273700 |
| Variance | 33896.378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59.9 | 2619 | 2.2% |
| 69.9 | 2113 | 1.8% |
| 49.9 | 2051 | 1.7% |
| 89.9 | 1644 | 1.4% |
| 99.9 | 1526 | 1.3% |
| 39.9 | 1403 | 1.2% |
| 29.9 | 1387 | 1.2% |
| 19.9 | 1284 | 1.1% |
| 79.9 | 1282 | 1.1% |
| 29.99 | 1228 | 1.0% |
| Other values (5958) | 101773 |
| Value | Count | Frequency (%) |
| 0.85 | 3 | < 0.1% |
| 1.2 | 20 | |
| 2.2 | 2 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 2.99 | 1 | < 0.1% |
| 3 | 2 | < 0.1% |
| 3.06 | 3 | < 0.1% |
| 3.49 | 3 | < 0.1% |
| 3.5 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 6735 | 1 | |
| 6729 | 1 | |
| 6499 | 1 | |
| 4799 | 1 | |
| 4690 | 1 | |
| 4590 | 1 | |
| 4399.87 | 1 | |
| 4099.99 | 1 | |
| 4059 | 1 | |
| 3999.9 | 1 |
freight_value
Real number (ℝ)
| Distinct | 6999 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.032387 |
| Minimum | 0 |
|---|---|
| Maximum | 409.68 |
| Zeros | 390 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.78 |
| Q1 | 13.08 |
| median | 16.28 |
| Q3 | 21.18 |
| 95-th percentile | 45.3 |
| Maximum | 409.68 |
| Range | 409.68 |
| Interquartile range (IQR) | 8.1 |
Descriptive statistics
| Standard deviation | 15.83685 |
|---|---|
| Coefficient of variation (CV) | 0.79056234 |
| Kurtosis | 57.635327 |
| Mean | 20.032387 |
| Median Absolute Deviation (MAD) | 3.63 |
| Skewness | 5.5433839 |
| Sum | 2370031.6 |
| Variance | 250.80583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.1 | 3861 | 3.2% |
| 7.78 | 2355 | 2.0% |
| 11.85 | 1999 | 1.7% |
| 14.1 | 1992 | 1.7% |
| 18.23 | 1632 | 1.4% |
| 7.39 | 1573 | 1.3% |
| 16.11 | 1211 | 1.0% |
| 15.23 | 1064 | 0.9% |
| 8.72 | 970 | 0.8% |
| 16.79 | 930 | 0.8% |
| Other values (6989) | 100723 |
| Value | Count | Frequency (%) |
| 0 | 390 | |
| 0.01 | 4 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.03 | 14 | < 0.1% |
| 0.04 | 4 | < 0.1% |
| 0.05 | 9 | < 0.1% |
| 0.06 | 13 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 12 | < 0.1% |
| 0.09 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 409.68 | 1 | |
| 375.28 | 2 | |
| 339.59 | 1 | |
| 338.3 | 1 | |
| 322.1 | 1 | |
| 321.88 | 1 | |
| 321.46 | 1 | |
| 317.47 | 1 | |
| 314.4 | 1 | |
| 314.02 | 1 |
payment_sequential
Real number (ℝ)
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0947373 |
| Minimum | 1 |
|---|---|
| Maximum | 29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 29 |
| Range | 28 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.73014099 |
|---|---|
| Coefficient of variation (CV) | 0.66695545 |
| Kurtosis | 342.28301 |
| Mean | 1.0947373 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.775506 |
| Sum | 130427 |
| Variance | 0.53310587 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 113999 | |
| 2 | 3415 | 2.9% |
| 3 | 658 | 0.6% |
| 4 | 322 | 0.3% |
| 5 | 194 | 0.2% |
| 6 | 136 | 0.1% |
| 7 | 94 | 0.1% |
| 8 | 63 | 0.1% |
| 9 | 51 | < 0.1% |
| 10 | 42 | < 0.1% |
| Other values (19) | 166 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 113999 | |
| 2 | 3415 | 2.9% |
| 3 | 658 | 0.6% |
| 4 | 322 | 0.3% |
| 5 | 194 | 0.2% |
| 6 | 136 | 0.1% |
| 7 | 94 | 0.1% |
| 8 | 63 | 0.1% |
| 9 | 51 | < 0.1% |
| 10 | 42 | < 0.1% |
| Value | Count | Frequency (%) |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 25 | 2 | < 0.1% |
| 24 | 2 | < 0.1% |
| 23 | 2 | < 0.1% |
| 22 | 3 | |
| 21 | 6 | |
| 20 | 6 |
payment_type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 1.8 MiB |
| credit_card | |
|---|---|
| boleto | |
| voucher | 6465 |
| debit_card | 1706 |
| not_defined | 3 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.7954004 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1167024 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | voucher |
| 3rd row | voucher |
| 4th row | credit_card |
| 5th row | credit_card |
Common Values
| Value | Count | Frequency (%) |
| credit_card | 87776 | |
| boleto | 23190 | 19.5% |
| voucher | 6465 | 5.4% |
| debit_card | 1706 | 1.4% |
| not_defined | 3 | < 0.1% |
| (Missing) | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit_card | 87776 | |
| boleto | 23190 | 19.5% |
| voucher | 6465 | 5.4% |
| debit_card | 1706 | 1.4% |
| not_defined | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 183723 | |
| r | 183723 | |
| d | 178970 | |
| e | 119143 | |
| t | 112675 | |
| i | 89485 | |
| _ | 89485 | |
| a | 89482 | |
| o | 52848 | 4.5% |
| b | 24896 | 2.1% |
| Other values (6) | 42594 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1077539 | |
| Connector Punctuation | 89485 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 183723 | |
| r | 183723 | |
| d | 178970 | |
| e | 119143 | |
| t | 112675 | |
| i | 89485 | |
| a | 89482 | |
| o | 52848 | 4.9% |
| b | 24896 | 2.3% |
| l | 23190 | 2.2% |
| Other values (5) | 19404 | 1.8% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 89485 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1077539 | |
| Common | 89485 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 183723 | |
| r | 183723 | |
| d | 178970 | |
| e | 119143 | |
| t | 112675 | |
| i | 89485 | |
| a | 89482 | |
| o | 52848 | 4.9% |
| b | 24896 | 2.3% |
| l | 23190 | 2.2% |
| Other values (5) | 19404 | 1.8% |
Common
| Value | Count | Frequency (%) |
| _ | 89485 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1167024 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 183723 | |
| r | 183723 | |
| d | 178970 | |
| e | 119143 | |
| t | 112675 | |
| i | 89485 | |
| _ | 89485 | |
| a | 89482 | |
| o | 52848 | 4.5% |
| b | 24896 | 2.1% |
| Other values (6) | 42594 | 3.6% |
payment_installments
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9412456 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.7778477 |
|---|---|
| Coefficient of variation (CV) | 0.94444604 |
| Kurtosis | 2.5065453 |
| Mean | 2.9412456 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6198199 |
| Sum | 350420 |
| Variance | 7.7164381 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 59446 | |
| 2 | 13838 | 11.6% |
| 3 | 11889 | 10.0% |
| 4 | 8072 | 6.8% |
| 10 | 6976 | 5.9% |
| 5 | 6097 | 5.1% |
| 8 | 5120 | 4.3% |
| 6 | 4674 | 3.9% |
| 7 | 1848 | 1.6% |
| 9 | 739 | 0.6% |
| Other values (14) | 441 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 1 | 59446 | |
| 2 | 13838 | 11.6% |
| 3 | 11889 | 10.0% |
| 4 | 8072 | 6.8% |
| 5 | 6097 | 5.1% |
| 6 | 4674 | 3.9% |
| 7 | 1848 | 1.6% |
| 8 | 5120 | 4.3% |
| 9 | 739 | 0.6% |
| Value | Count | Frequency (%) |
| 24 | 34 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 6 | < 0.1% |
| 20 | 21 | < 0.1% |
| 18 | 38 | |
| 17 | 8 | < 0.1% |
| 16 | 7 | < 0.1% |
| 15 | 93 | |
| 14 | 16 | < 0.1% |
payment_value
Real number (ℝ)
| Distinct | 29077 |
|---|---|
| Distinct (%) | 24.4% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.73514 |
| Minimum | 0 |
|---|---|
| Maximum | 13664.08 |
| Zeros | 9 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 27.1 |
| Q1 | 60.85 |
| median | 108.16 |
| Q3 | 189.24 |
| 95-th percentile | 515.93 |
| Maximum | 13664.08 |
| Range | 13664.08 |
| Interquartile range (IQR) | 128.39 |
Descriptive statistics
| Standard deviation | 267.77608 |
|---|---|
| Coefficient of variation (CV) | 1.550212 |
| Kurtosis | 500.3632 |
| Mean | 172.73514 |
| Median Absolute Deviation (MAD) | 56.64 |
| Skewness | 13.965989 |
| Sum | 20579664 |
| Variance | 71704.027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 351 | 0.3% |
| 100 | 300 | 0.3% |
| 20 | 286 | 0.2% |
| 77.57 | 250 | 0.2% |
| 35 | 166 | 0.1% |
| 73.34 | 160 | 0.1% |
| 30 | 139 | 0.1% |
| 116.94 | 133 | 0.1% |
| 56.78 | 123 | 0.1% |
| 65 | 120 | 0.1% |
| Other values (29067) | 117112 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 0.01 | 6 | |
| 0.03 | 2 | < 0.1% |
| 0.05 | 2 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| 0.13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 13664.08 | 8 | |
| 7274.88 | 4 | |
| 6929.31 | 1 | < 0.1% |
| 6922.21 | 1 | < 0.1% |
| 6726.66 | 1 | < 0.1% |
| 6081.54 | 6 | |
| 4950.34 | 1 | < 0.1% |
| 4809.44 | 2 | < 0.1% |
| 4764.34 | 1 | < 0.1% |
| 4681.78 | 1 | < 0.1% |
review_id
Text
| Distinct | 98410 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 997 |
| Missing (%) | 0.8% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3780672 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 85411 ? |
|---|---|
| Unique (%) | 72.3% |
Sample
| 1st row | a54f0611adc9ed256b57ede6b6eb5114 |
|---|---|
| 2nd row | a54f0611adc9ed256b57ede6b6eb5114 |
| 3rd row | a54f0611adc9ed256b57ede6b6eb5114 |
| 4th row | b46f1e34512b0f4c74a72398b03ca788 |
| 5th row | dc90f19c2806f1abba9e72ad3c350073 |
| Value | Count | Frequency (%) |
| eef5dbca8d37dfce6db7d7b16dd0525e | 63 | 0.1% |
| 7145a6f0d38ec713897856cbdcfcdb7f | 38 | < 0.1% |
| f28281373ab8815bafafe371218f02ce | 29 | < 0.1% |
| 8823bba1e3301fee652eb06de8ef9435 | 26 | < 0.1% |
| b0c2f8c122ebef9f77753f7d167cf634 | 24 | < 0.1% |
| b79b22bb50f78f1afe361661011fd892 | 24 | < 0.1% |
| cc074f1c33940c4f0dd904705f98e39e | 24 | < 0.1% |
| b5292206f96cd5d97359940203a0b510 | 24 | < 0.1% |
| 7e568736c98c553aea896a5dca746d5a | 22 | < 0.1% |
| 8fb71ed887db39231871ef3d1ba781cf | 21 | < 0.1% |
| Other values (98400) | 117851 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 237115 | 6.3% |
| 6 | 237048 | 6.3% |
| 5 | 236734 | 6.3% |
| b | 236598 | 6.3% |
| d | 236586 | 6.3% |
| 8 | 236562 | 6.3% |
| f | 236490 | 6.3% |
| 1 | 236390 | 6.3% |
| 0 | 236298 | 6.3% |
| 7 | 236091 | 6.2% |
| Other values (6) | 1414760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2361788 | |
| Lowercase Letter | 1418884 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 237048 | |
| 5 | 236734 | |
| 8 | 236562 | |
| 1 | 236390 | |
| 0 | 236298 | |
| 7 | 236091 | |
| 2 | 235983 | |
| 9 | 235791 | |
| 3 | 235541 | |
| 4 | 235350 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 237115 | |
| b | 236598 | |
| d | 236586 | |
| f | 236490 | |
| e | 236068 | |
| c | 236027 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2361788 | |
| Latin | 1418884 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 237048 | |
| 5 | 236734 | |
| 8 | 236562 | |
| 1 | 236390 | |
| 0 | 236298 | |
| 7 | 236091 | |
| 2 | 235983 | |
| 9 | 235791 | |
| 3 | 235541 | |
| 4 | 235350 |
Latin
| Value | Count | Frequency (%) |
| a | 237115 | |
| b | 236598 | |
| d | 236586 | |
| f | 236490 | |
| e | 236068 | |
| c | 236027 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3780672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 237115 | 6.3% |
| 6 | 237048 | 6.3% |
| 5 | 236734 | 6.3% |
| b | 236598 | 6.3% |
| d | 236586 | 6.3% |
| 8 | 236562 | 6.3% |
| f | 236490 | 6.3% |
| 1 | 236390 | 6.3% |
| 0 | 236298 | 6.3% |
| 7 | 236091 | 6.2% |
| Other values (6) | 1414760 |
review_score
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 997 |
| Missing (%) | 0.8% |
| Memory size | 1.8 MiB |
| 5.0 | |
|---|---|
| 4.0 | |
| 1.0 | |
| 3.0 | |
| 2.0 | 4162 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 354438 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.0 |
|---|---|
| 2nd row | 4.0 |
| 3rd row | 4.0 |
| 4th row | 4.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 5.0 | 66343 | |
| 4.0 | 22319 | 18.7% |
| 1.0 | 15428 | 12.9% |
| 3.0 | 9894 | 8.3% |
| 2.0 | 4162 | 3.5% |
| (Missing) | 997 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5.0 | 66343 | |
| 4.0 | 22319 | 18.9% |
| 1.0 | 15428 | 13.1% |
| 3.0 | 9894 | 8.4% |
| 2.0 | 4162 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 118146 | |
| 0 | 118146 | |
| 5 | 66343 | |
| 4 | 22319 | 6.3% |
| 1 | 15428 | 4.4% |
| 3 | 9894 | 2.8% |
| 2 | 4162 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 236292 | |
| Other Punctuation | 118146 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 118146 | |
| 5 | 66343 | |
| 4 | 22319 | 9.4% |
| 1 | 15428 | 6.5% |
| 3 | 9894 | 4.2% |
| 2 | 4162 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 118146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 354438 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 118146 | |
| 0 | 118146 | |
| 5 | 66343 | |
| 4 | 22319 | 6.3% |
| 1 | 15428 | 4.4% |
| 3 | 9894 | 2.8% |
| 2 | 4162 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 354438 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 118146 | |
| 0 | 118146 | |
| 5 | 66343 | |
| 4 | 22319 | 6.3% |
| 1 | 15428 | 4.4% |
| 3 | 9894 | 2.8% |
| 2 | 4162 | 1.2% |
| Distinct | 4527 |
|---|---|
| Distinct (%) | 32.4% |
| Missing | 105154 |
| Missing (%) | 88.3% |
| Memory size | 1.8 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 20 |
| Mean length | 12.213525 |
| Min length | 1 |
Characters and Unicode
| Total characters | 170855 |
|---|---|
| Distinct characters | 125 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3095 ? |
|---|---|
| Unique (%) | 22.1% |
Sample
| 1st row | Muito boa a loja |
|---|---|
| 2nd row | super recomendo |
| 3rd row | Muito bom |
| 4th row | super recomendo |
| 5th row | super recomendo |
| Value | Count | Frequency (%) |
| recomendo | 2478 | 9.3% |
| produto | 1570 | 5.9% |
| bom | 1521 | 5.7% |
| muito | 1040 | 3.9% |
| super | 1039 | 3.9% |
| não | 937 | 3.5% |
| ótimo | 809 | 3.0% |
| excelente | 769 | 2.9% |
| entrega | 716 | 2.7% |
| recebi | 445 | 1.7% |
| Other values (2100) | 15445 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 21273 | 12.5% |
| e | 18299 | 10.7% |
| 15200 | 8.9% | |
| r | 9902 | 5.8% |
| t | 9433 | 5.5% |
| a | 9158 | 5.4% |
| m | 8444 | 4.9% |
| d | 8214 | 4.8% |
| i | 8084 | 4.7% |
| n | 7660 | 4.5% |
| Other values (115) | 55188 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 132750 | |
| Uppercase Letter | 18878 | 11.0% |
| Space Separator | 15200 | 8.9% |
| Other Punctuation | 2683 | 1.6% |
| Decimal Number | 1242 | 0.7% |
| Other Symbol | 48 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
| Modifier Symbol | 13 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Close Punctuation | 7 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 21273 | |
| e | 18299 | |
| r | 9902 | 7.5% |
| t | 9433 | 7.1% |
| a | 9158 | 6.9% |
| m | 8444 | 6.4% |
| d | 8214 | 6.2% |
| i | 8084 | 6.1% |
| n | 7660 | 5.8% |
| c | 5955 | 4.5% |
| Other values (31) | 26328 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2533 | |
| R | 2048 | |
| O | 1675 | 8.9% |
| P | 1617 | 8.6% |
| M | 1448 | 7.7% |
| N | 1205 | 6.4% |
| S | 1046 | 5.5% |
| A | 989 | 5.2% |
| Ó | 933 | 4.9% |
| B | 896 | 4.7% |
| Other values (26) | 4488 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 1208 | |
| . | 787 | |
| * | 404 | 15.1% |
| , | 152 | 5.7% |
| ? | 52 | 1.9% |
| / | 39 | 1.5% |
| % | 22 | 0.8% |
| : | 6 | 0.2% |
| ; | 4 | 0.1% |
| " | 3 | 0.1% |
| Other values (4) | 6 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 468 | |
| 1 | 413 | |
| 5 | 82 | 6.6% |
| 2 | 77 | 6.2% |
| 8 | 46 | 3.7% |
| 3 | 43 | 3.5% |
| 4 | 39 | 3.1% |
| 9 | 29 | 2.3% |
| 7 | 24 | 1.9% |
| 6 | 21 | 1.7% |
Other Symbol
| Value | Count | Frequency (%) |
| 👍 | 16 | |
| 😍 | 9 | |
| 👏 | 7 | |
| 🌟 | 6 | 12.5% |
| 💥 | 5 | 10.4% |
| 🚚 | 1 | 2.1% |
| 👎 | 1 | 2.1% |
| 😀 | 1 | 2.1% |
| 🔟 | 1 | 2.1% |
| 🤗 | 1 | 2.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 6 | |
| 🏻 | 3 | |
| 🏽 | 2 | 15.4% |
| 🏼 | 2 | 15.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 | |
| = | 1 | 12.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 | |
| ] | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 15200 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 151629 | |
| Common | 19226 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 21273 | |
| e | 18299 | |
| r | 9902 | 6.5% |
| t | 9433 | 6.2% |
| a | 9158 | 6.0% |
| m | 8444 | 5.6% |
| d | 8214 | 5.4% |
| i | 8084 | 5.3% |
| n | 7660 | 5.1% |
| c | 5955 | 3.9% |
| Other values (68) | 45207 |
Common
| Value | Count | Frequency (%) |
| 15200 | ||
| ! | 1208 | 6.3% |
| . | 787 | 4.1% |
| 0 | 468 | 2.4% |
| 1 | 413 | 2.1% |
| * | 404 | 2.1% |
| , | 152 | 0.8% |
| 5 | 82 | 0.4% |
| 2 | 77 | 0.4% |
| ? | 52 | 0.3% |
| Other values (37) | 383 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 167210 | |
| None | 3635 | 2.1% |
| Emoticons | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 21273 | |
| e | 18299 | 10.9% |
| 15200 | 9.1% | |
| r | 9902 | 5.9% |
| t | 9433 | 5.6% |
| a | 9158 | 5.5% |
| m | 8444 | 5.0% |
| d | 8214 | 4.9% |
| i | 8084 | 4.8% |
| n | 7660 | 4.6% |
| Other values (75) | 51543 |
None
| Value | Count | Frequency (%) |
| ã | 1125 | |
| Ó | 933 | |
| á | 360 | 9.9% |
| ç | 347 | 9.5% |
| é | 257 | 7.1% |
| ó | 246 | 6.8% |
| Ã | 71 | 2.0% |
| í | 67 | 1.8% |
| ê | 47 | 1.3% |
| É | 30 | 0.8% |
| Other values (28) | 152 | 4.2% |
Emoticons
| Value | Count | Frequency (%) |
| 😍 | 9 | |
| 😀 | 1 | 10.0% |
| Distinct | 36159 |
|---|---|
| Distinct (%) | 72.0% |
| Missing | 68898 |
| Missing (%) | 57.8% |
| Memory size | 1.8 MiB |
Length
| Max length | 208 |
|---|---|
| Median length | 159 |
| Mean length | 70.656662 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3550144 |
|---|---|
| Distinct characters | 209 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 29556 ? |
|---|---|
| Unique (%) | 58.8% |
Sample
| 1st row | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. |
|---|---|
| 2nd row | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. |
| 3rd row | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. |
| 4th row | Deveriam embalar melhor o produto. A caixa veio toda amassada e vou dar de presente. |
| 5th row | Só achei ela pequena pra seis xícaras ,mais é um bom produto |
| Value | Count | Frequency (%) |
| o | 23016 | 3.8% |
| produto | 21207 | 3.5% |
| e | 20046 | 3.3% |
| a | 15246 | 2.5% |
| de | 14566 | 2.4% |
| não | 13436 | 2.2% |
| do | 13109 | 2.2% |
| que | 10560 | 1.7% |
| prazo | 9427 | 1.6% |
| muito | 9086 | 1.5% |
| Other values (19737) | 456676 |
Most occurring characters
| Value | Count | Frequency (%) |
| 562672 | ||
| o | 350271 | 9.9% |
| e | 340009 | 9.6% |
| a | 282074 | 7.9% |
| r | 200163 | 5.6% |
| i | 163758 | 4.6% |
| t | 161344 | 4.5% |
| d | 150134 | 4.2% |
| n | 138033 | 3.9% |
| s | 134085 | 3.8% |
| Other values (199) | 1067601 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2682795 | |
| Space Separator | 562672 | 15.8% |
| Uppercase Letter | 168680 | 4.8% |
| Other Punctuation | 96429 | 2.7% |
| Decimal Number | 22010 | 0.6% |
| Control | 14006 | 0.4% |
| Dash Punctuation | 984 | < 0.1% |
| Close Punctuation | 750 | < 0.1% |
| Open Punctuation | 731 | < 0.1% |
| Other Symbol | 703 | < 0.1% |
| Other values (5) | 384 | < 0.1% |
Most frequent character per category
Other Symbol
| Value | Count | Frequency (%) |
| 👏 | 241 | |
| 👍 | 88 | 12.5% |
| 😍 | 76 | 10.8% |
| ° | 27 | 3.8% |
| 😉 | 23 | 3.3% |
| 😘 | 20 | 2.8% |
| 😡 | 19 | 2.7% |
| 😆 | 19 | 2.7% |
| 👎 | 13 | 1.8% |
| 😁 | 13 | 1.8% |
| Other values (55) | 164 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 350271 | |
| e | 340009 | |
| a | 282074 | |
| r | 200163 | 7.5% |
| i | 163758 | 6.1% |
| t | 161344 | 6.0% |
| d | 150134 | 5.6% |
| n | 138033 | 5.1% |
| s | 134085 | 5.0% |
| m | 128515 | 4.8% |
| Other values (40) | 634409 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 19919 | |
| O | 18865 | |
| A | 17497 | |
| P | 12531 | 7.4% |
| R | 12209 | 7.2% |
| C | 9839 | 5.8% |
| N | 9688 | 5.7% |
| M | 9486 | 5.6% |
| S | 8362 | 5.0% |
| T | 7870 | 4.7% |
| Other values (31) | 42414 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 50682 | |
| , | 28154 | |
| ! | 12761 | 13.2% |
| / | 1881 | 2.0% |
| ? | 1630 | 1.7% |
| " | 440 | 0.5% |
| : | 307 | 0.3% |
| ; | 233 | 0.2% |
| % | 194 | 0.2% |
| * | 78 | 0.1% |
| Other values (5) | 69 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5074 | |
| 1 | 5056 | |
| 2 | 4230 | |
| 3 | 1967 | 8.9% |
| 4 | 1367 | 6.2% |
| 5 | 1246 | 5.7% |
| 8 | 946 | 4.3% |
| 6 | 905 | 4.1% |
| 7 | 753 | 3.4% |
| 9 | 466 | 2.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 88 | |
| = | 29 | 19.7% |
| | | 12 | 8.2% |
| < | 10 | 6.8% |
| ~ | 3 | 2.0% |
| × | 2 | 1.4% |
| > | 2 | 1.4% |
| ÷ | 1 | 0.7% |
Modifier Symbol
| Value | Count | Frequency (%) |
| 🏻 | 36 | |
| ´ | 26 | |
| 🏼 | 15 | |
| 🏽 | 13 | 12.5% |
| ^ | 8 | 7.7% |
| 🏾 | 4 | 3.8% |
| ` | 2 | 1.9% |
Control
| Value | Count | Frequency (%) |
| 6993 | ||
| 6993 | ||
| 20 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 747 | |
| ] | 3 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 725 | |
| [ | 6 | 0.8% |
Other Letter
| Value | Count | Frequency (%) |
| º | 26 | |
| ª | 19 |
Space Separator
| Value | Count | Frequency (%) |
| 562672 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 984 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 80 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2851520 | |
| Common | 698624 | 19.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 562672 | ||
| . | 50682 | 7.3% |
| , | 28154 | 4.0% |
| ! | 12761 | 1.8% |
| 6993 | 1.0% | |
| 6993 | 1.0% | |
| 0 | 5074 | 0.7% |
| 1 | 5056 | 0.7% |
| 2 | 4230 | 0.6% |
| 3 | 1967 | 0.3% |
| Other values (106) | 14042 | 2.0% |
Latin
| Value | Count | Frequency (%) |
| o | 350271 | |
| e | 340009 | |
| a | 282074 | 9.9% |
| r | 200163 | 7.0% |
| i | 163758 | 5.7% |
| t | 161344 | 5.7% |
| d | 150134 | 5.3% |
| n | 138033 | 4.8% |
| s | 134085 | 4.7% |
| m | 128515 | 4.5% |
| Other values (83) | 803134 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3485523 | |
| None | 64369 | 1.8% |
| Emoticons | 252 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 562672 | ||
| o | 350271 | 10.0% |
| e | 340009 | 9.8% |
| a | 282074 | 8.1% |
| r | 200163 | 5.7% |
| i | 163758 | 4.7% |
| t | 161344 | 4.6% |
| d | 150134 | 4.3% |
| n | 138033 | 4.0% |
| s | 134085 | 3.8% |
| Other values (85) | 1002980 |
None
| Value | Count | Frequency (%) |
| ã | 19200 | |
| é | 11589 | |
| á | 9179 | |
| ç | 7483 | 11.6% |
| ó | 6322 | 9.8% |
| ê | 1962 | 3.0% |
| í | 1794 | 2.8% |
| Ó | 1563 | 2.4% |
| õ | 948 | 1.5% |
| ú | 913 | 1.4% |
| Other values (73) | 3416 | 5.3% |
Emoticons
| Value | Count | Frequency (%) |
| 😍 | 76 | |
| 😉 | 23 | 9.1% |
| 😘 | 20 | 7.9% |
| 😡 | 19 | 7.5% |
| 😆 | 19 | 7.5% |
| 😁 | 13 | 5.2% |
| 😊 | 12 | 4.8% |
| 😀 | 8 | 3.2% |
| 😩 | 7 | 2.8% |
| 😃 | 6 | 2.4% |
| Other values (21) | 49 |
| Distinct | 636 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 997 |
| Missing (%) | 0.8% |
| Memory size | 1.8 MiB |
| Minimum | 2016-10-02 00:00:00 |
|---|---|
| Maximum | 2018-08-31 00:00:00 |
| Distinct | 98248 |
|---|---|
| Distinct (%) | 83.2% |
| Missing | 997 |
| Missing (%) | 0.8% |
| Memory size | 1.8 MiB |
| Minimum | 2016-10-07 18:32:28 |
|---|---|
| Maximum | 2018-10-29 12:27:35 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2542 |
| Missing (%) | 2.1% |
| Memory size | 1.8 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 32 |
| Mean length | 14.876202 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1734580 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | utilidades_domesticas |
|---|---|
| 2nd row | utilidades_domesticas |
| 3rd row | utilidades_domesticas |
| 4th row | utilidades_domesticas |
| 5th row | utilidades_domesticas |
| Value | Count | Frequency (%) |
| cama_mesa_banho | 11988 | 10.3% |
| beleza_saude | 10032 | 8.6% |
| esporte_lazer | 9004 | 7.7% |
| moveis_decoracao | 8832 | 7.6% |
| informatica_acessorios | 8150 | 7.0% |
| utilidades_domesticas | 7380 | 6.3% |
| relogios_presentes | 6213 | 5.3% |
| telefonia | 4726 | 4.1% |
| ferramentas_jardim | 4590 | 3.9% |
| automotivo | 4400 | 3.8% |
| Other values (63) | 41286 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 210541 | |
| a | 208834 | |
| s | 172854 | |
| o | 171612 | |
| i | 115171 | 6.6% |
| r | 111455 | 6.4% |
| _ | 110493 | 6.4% |
| t | 83288 | 4.8% |
| c | 82435 | 4.8% |
| m | 78683 | 4.5% |
| Other values (18) | 389214 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1623786 | |
| Connector Punctuation | 110493 | 6.4% |
| Decimal Number | 301 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 210541 | |
| a | 208834 | |
| s | 172854 | |
| o | 171612 | |
| i | 115171 | 7.1% |
| r | 111455 | 6.9% |
| t | 83288 | 5.1% |
| c | 82435 | 5.1% |
| m | 78683 | 4.8% |
| n | 59158 | 3.6% |
| Other values (16) | 329755 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 110493 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 301 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1623786 | |
| Common | 110794 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 210541 | |
| a | 208834 | |
| s | 172854 | |
| o | 171612 | |
| i | 115171 | 7.1% |
| r | 111455 | 6.9% |
| t | 83288 | 5.1% |
| c | 82435 | 5.1% |
| m | 78683 | 4.8% |
| n | 59158 | 3.6% |
| Other values (16) | 329755 |
Common
| Value | Count | Frequency (%) |
| _ | 110493 | |
| 2 | 301 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1734580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 210541 | |
| a | 208834 | |
| s | 172854 | |
| o | 171612 | |
| i | 115171 | 6.6% |
| r | 111455 | 6.4% |
| _ | 110493 | 6.4% |
| t | 83288 | 4.8% |
| c | 82435 | 4.8% |
| m | 78683 | 4.5% |
| Other values (18) | 389214 |
product_name_lenght
Real number (ℝ)
| Distinct | 66 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2542 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.767498 |
| Minimum | 5 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 42 |
| median | 52 |
| Q3 | 57 |
| 95-th percentile | 60 |
| Maximum | 76 |
| Range | 71 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.03354 |
|---|---|
| Coefficient of variation (CV) | 0.20574236 |
| Kurtosis | 0.14950788 |
| Mean | 48.767498 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.90489402 |
| Sum | 5686339 |
| Variance | 100.67193 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59 | 8679 | 7.3% |
| 60 | 8070 | 6.8% |
| 56 | 6847 | 5.7% |
| 58 | 6819 | 5.7% |
| 57 | 6302 | 5.3% |
| 55 | 5833 | 4.9% |
| 54 | 5529 | 4.6% |
| 53 | 4357 | 3.7% |
| 52 | 4328 | 3.6% |
| 49 | 3690 | 3.1% |
| Other values (56) | 56147 |
| Value | Count | Frequency (%) |
| 5 | 9 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 15 | < 0.1% |
| 10 | 9 | < 0.1% |
| 11 | 11 | < 0.1% |
| 12 | 38 | |
| 13 | 26 | |
| 14 | 47 |
| Value | Count | Frequency (%) |
| 76 | 1 | < 0.1% |
| 72 | 9 | < 0.1% |
| 69 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
| 67 | 3 | < 0.1% |
| 66 | 1 | < 0.1% |
| 64 | 174 | 0.1% |
| 63 | 1350 | |
| 62 | 167 | 0.1% |
| 61 | 241 | 0.2% |
product_description_lenght
Real number (ℝ)
| Distinct | 2960 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 2542 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 785.96782 |
| Minimum | 4 |
|---|---|
| Maximum | 3992 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 160 |
| Q1 | 346 |
| median | 600 |
| Q3 | 983 |
| 95-th percentile | 2123 |
| Maximum | 3992 |
| Range | 3988 |
| Interquartile range (IQR) | 637 |
Descriptive statistics
| Standard deviation | 652.58412 |
|---|---|
| Coefficient of variation (CV) | 0.83029369 |
| Kurtosis | 4.9299321 |
| Mean | 785.96782 |
| Median Absolute Deviation (MAD) | 296 |
| Skewness | 2.0121562 |
| Sum | 91644634 |
| Variance | 425866.03 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 341 | 711 | 0.6% |
| 1893 | 667 | 0.6% |
| 348 | 648 | 0.5% |
| 903 | 594 | 0.5% |
| 492 | 594 | 0.5% |
| 245 | 587 | 0.5% |
| 366 | 537 | 0.5% |
| 236 | 516 | 0.4% |
| 340 | 487 | 0.4% |
| 919 | 442 | 0.4% |
| Other values (2950) | 110818 | |
| (Missing) | 2542 | 2.1% |
| Value | Count | Frequency (%) |
| 4 | 6 | |
| 8 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 7 | |
| 23 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 4 | |
| 28 | 2 | < 0.1% |
| 30 | 8 | |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3992 | 2 | < 0.1% |
| 3988 | 1 | < 0.1% |
| 3985 | 3 | |
| 3976 | 6 | |
| 3963 | 1 | < 0.1% |
| 3956 | 3 | |
| 3954 | 2 | < 0.1% |
| 3950 | 2 | < 0.1% |
| 3949 | 1 | < 0.1% |
| 3948 | 1 | < 0.1% |
product_photos_qty
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2542 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2051612 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7174519 |
|---|---|
| Coefficient of variation (CV) | 0.7788328 |
| Kurtosis | 4.8200793 |
| Mean | 2.2051612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9087497 |
| Sum | 257124 |
| Variance | 2.9496409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 58957 | |
| 2 | 23054 | 19.3% |
| 3 | 12978 | 10.9% |
| 4 | 8863 | 7.4% |
| 5 | 5599 | 4.7% |
| 6 | 3945 | 3.3% |
| 7 | 1560 | 1.3% |
| 8 | 774 | 0.6% |
| 10 | 354 | 0.3% |
| 9 | 318 | 0.3% |
| Other values (9) | 199 | 0.2% |
| (Missing) | 2542 | 2.1% |
| Value | Count | Frequency (%) |
| 1 | 58957 | |
| 2 | 23054 | 19.3% |
| 3 | 12978 | 10.9% |
| 4 | 8863 | 7.4% |
| 5 | 5599 | 4.7% |
| 6 | 3945 | 3.3% |
| 7 | 1560 | 1.3% |
| 8 | 774 | 0.6% |
| 9 | 318 | 0.3% |
| 10 | 354 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 4 | < 0.1% |
| 17 | 11 | < 0.1% |
| 15 | 12 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 30 | < 0.1% |
| 12 | 60 | 0.1% |
| 11 | 73 | 0.1% |
| 10 | 354 |
product_weight_g
Real number (ℝ)
| Distinct | 2204 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 853 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2112.2507 |
| Minimum | 0 |
|---|---|
| Maximum | 40425 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9850 |
| Maximum | 40425 |
| Range | 40425 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3786.6951 |
|---|---|
| Coefficient of variation (CV) | 1.7927299 |
| Kurtosis | 16.01826 |
| Mean | 2112.2507 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.5830918 |
| Sum | 2.4985814 × 108 |
| Variance | 14339060 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 7093 | 6.0% |
| 150 | 5410 | 4.5% |
| 250 | 4741 | 4.0% |
| 300 | 4429 | 3.7% |
| 400 | 3787 | 3.2% |
| 100 | 3666 | 3.1% |
| 350 | 3291 | 2.8% |
| 500 | 2856 | 2.4% |
| 600 | 2838 | 2.4% |
| 700 | 2148 | 1.8% |
| Other values (2194) | 78031 |
| Value | Count | Frequency (%) |
| 0 | 8 | < 0.1% |
| 2 | 5 | < 0.1% |
| 25 | 3 | < 0.1% |
| 50 | 991 | |
| 53 | 2 | < 0.1% |
| 54 | 2 | < 0.1% |
| 55 | 2 | < 0.1% |
| 58 | 1 | < 0.1% |
| 60 | 9 | < 0.1% |
| 61 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 40425 | 3 | < 0.1% |
| 30000 | 303 | |
| 29800 | 1 | < 0.1% |
| 29750 | 1 | < 0.1% |
| 29700 | 4 | < 0.1% |
| 29600 | 5 | < 0.1% |
| 29500 | 2 | < 0.1% |
| 29250 | 1 | < 0.1% |
| 29150 | 1 | < 0.1% |
| 29100 | 1 | < 0.1% |
product_length_cm
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 853 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.265145 |
| Minimum | 7 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 62 |
| Maximum | 105 |
| Range | 98 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.189367 |
|---|---|
| Coefficient of variation (CV) | 0.53491788 |
| Kurtosis | 3.6785662 |
| Mean | 30.265145 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.7456849 |
| Sum | 3580064 |
| Variance | 262.09561 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 18363 | 15.4% |
| 20 | 10999 | 9.2% |
| 30 | 7951 | 6.7% |
| 17 | 6202 | 5.2% |
| 18 | 5909 | 5.0% |
| 19 | 4898 | 4.1% |
| 25 | 4871 | 4.1% |
| 40 | 4360 | 3.7% |
| 22 | 4000 | 3.4% |
| 50 | 3163 | 2.7% |
| Other values (89) | 47574 |
| Value | Count | Frequency (%) |
| 7 | 32 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 4 | < 0.1% |
| 10 | 8 | < 0.1% |
| 11 | 96 | 0.1% |
| 12 | 41 | < 0.1% |
| 13 | 60 | 0.1% |
| 14 | 138 | 0.1% |
| 15 | 220 | 0.2% |
| 16 | 18363 |
| Value | Count | Frequency (%) |
| 105 | 335 | |
| 104 | 35 | < 0.1% |
| 103 | 46 | < 0.1% |
| 102 | 60 | 0.1% |
| 101 | 108 | 0.1% |
| 100 | 429 | |
| 99 | 36 | < 0.1% |
| 98 | 50 | < 0.1% |
| 97 | 11 | < 0.1% |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 853 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.619706 |
| Minimum | 2 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 45 |
| Maximum | 105 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.453584 |
|---|---|
| Coefficient of variation (CV) | 0.80949592 |
| Kurtosis | 7.2778781 |
| Mean | 16.619706 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2389625 |
| Sum | 1965945 |
| Variance | 180.99892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 10374 | 8.7% |
| 20 | 6915 | 5.8% |
| 15 | 6896 | 5.8% |
| 12 | 6520 | 5.5% |
| 11 | 6432 | 5.4% |
| 2 | 5254 | 4.4% |
| 4 | 4910 | 4.1% |
| 8 | 4873 | 4.1% |
| 5 | 4776 | 4.0% |
| 16 | 4765 | 4.0% |
| Other values (92) | 56575 |
| Value | Count | Frequency (%) |
| 2 | 5254 | |
| 3 | 2821 | 2.4% |
| 4 | 4910 | |
| 5 | 4776 | |
| 6 | 3576 | 3.0% |
| 7 | 4387 | |
| 8 | 4873 | |
| 9 | 3408 | 2.9% |
| 10 | 10374 | |
| 11 | 6432 |
| Value | Count | Frequency (%) |
| 105 | 139 | |
| 104 | 14 | < 0.1% |
| 103 | 49 | < 0.1% |
| 102 | 10 | < 0.1% |
| 100 | 43 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 8 | < 0.1% |
| 95 | 22 | < 0.1% |
product_width_cm
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 853 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.074799 |
| Minimum | 6 |
|---|---|
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 112 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.749139 |
|---|---|
| Coefficient of variation (CV) | 0.50917622 |
| Kurtosis | 4.5530162 |
| Mean | 23.074799 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.707171 |
| Sum | 2729518 |
| Variance | 138.04227 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 12701 | 10.7% |
| 11 | 11144 | 9.4% |
| 15 | 9376 | 7.9% |
| 16 | 8810 | 7.4% |
| 30 | 8045 | 6.8% |
| 12 | 5711 | 4.8% |
| 13 | 5491 | 4.6% |
| 14 | 4846 | 4.1% |
| 18 | 4192 | 3.5% |
| 40 | 4157 | 3.5% |
| Other values (85) | 43817 |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 29 | < 0.1% |
| 9 | 51 | < 0.1% |
| 10 | 83 | 0.1% |
| 11 | 11144 | |
| 12 | 5711 | |
| 13 | 5491 | |
| 14 | 4846 | |
| 15 | 9376 |
| Value | Count | Frequency (%) |
| 118 | 8 | < 0.1% |
| 105 | 14 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| 100 | 43 | |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
| Distinct | 96096 |
|---|---|
| Distinct (%) | 80.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3812576 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 81601 ? |
|---|---|
| Unique (%) | 68.5% |
Sample
| 1st row | 7c396fd4830fd04220f754e42b4e5bff |
|---|---|
| 2nd row | 7c396fd4830fd04220f754e42b4e5bff |
| 3rd row | 7c396fd4830fd04220f754e42b4e5bff |
| 4th row | 3a51803cc0d012c3b5dc8b7528cb05f7 |
| 5th row | ef0996a1a279c26e7ecbd737be23d235 |
| Value | Count | Frequency (%) |
| 9a736b248f67d166d2fbb006bcb877c3 | 75 | 0.1% |
| 6fbc7cdadbb522125f4b27ae9dee4060 | 38 | < 0.1% |
| f9ae226291893fda10af7965268fb7f6 | 35 | < 0.1% |
| 8af7ac63b2efbcbd88e5b11505e8098a | 29 | < 0.1% |
| 569aa12b73b5f7edeaa6f2a01603e381 | 26 | < 0.1% |
| 85963fd37bfd387aa6d915d8a1065486 | 24 | < 0.1% |
| db1af3fd6b23ac3873ef02619d548f9c | 24 | < 0.1% |
| 1d2435aa3b858d45c707c9fc25e18779 | 24 | < 0.1% |
| 5419a7c9b86a43d8140e2939cd2c2f7e | 24 | < 0.1% |
| c8460e4251689ba205045f3ea17884a1 | 24 | < 0.1% |
| Other values (96086) | 118820 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 239302 | 6.3% |
| b | 238901 | 6.3% |
| 1 | 238720 | 6.3% |
| a | 238717 | 6.3% |
| d | 238546 | 6.3% |
| 3 | 238474 | 6.3% |
| 8 | 238431 | 6.3% |
| e | 238285 | 6.2% |
| 5 | 238244 | 6.2% |
| 2 | 238230 | 6.2% |
| Other values (6) | 1426726 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2382587 | |
| Lowercase Letter | 1429989 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 239302 | |
| 1 | 238720 | |
| 3 | 238474 | |
| 8 | 238431 | |
| 5 | 238244 | |
| 2 | 238230 | |
| 9 | 238131 | |
| 7 | 238000 | |
| 0 | 237831 | |
| 4 | 237224 |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 238901 | |
| a | 238717 | |
| d | 238546 | |
| e | 238285 | |
| f | 238017 | |
| c | 237523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2382587 | |
| Latin | 1429989 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 239302 | |
| 1 | 238720 | |
| 3 | 238474 | |
| 8 | 238431 | |
| 5 | 238244 | |
| 2 | 238230 | |
| 9 | 238131 | |
| 7 | 238000 | |
| 0 | 237831 | |
| 4 | 237224 |
Latin
| Value | Count | Frequency (%) |
| b | 238901 | |
| a | 238717 | |
| d | 238546 | |
| e | 238285 | |
| f | 238017 | |
| c | 237523 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3812576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 239302 | 6.3% |
| b | 238901 | 6.3% |
| 1 | 238720 | 6.3% |
| a | 238717 | 6.3% |
| d | 238546 | 6.3% |
| 3 | 238474 | 6.3% |
| 8 | 238431 | 6.3% |
| e | 238285 | 6.2% |
| 5 | 238244 | 6.2% |
| 2 | 238230 | 6.2% |
| Other values (6) | 1426726 |
customer_zip_code_prefix
Real number (ℝ)
| Distinct | 14994 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35033.451 |
| Minimum | 1003 |
|---|---|
| Maximum | 99990 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1003 |
|---|---|
| 5-th percentile | 3275 |
| Q1 | 11250 |
| median | 24240 |
| Q3 | 58475 |
| 95-th percentile | 90570 |
| Maximum | 99990 |
| Range | 98987 |
| Interquartile range (IQR) | 47225 |
Descriptive statistics
| Standard deviation | 29823.199 |
|---|---|
| Coefficient of variation (CV) | 0.85127779 |
| Kurtosis | -0.78102574 |
| Mean | 35033.451 |
| Median Absolute Deviation (MAD) | 16230 |
| Skewness | 0.7854738 |
| Sum | 4.1739905 × 109 |
| Variance | 8.894232 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24220 | 164 | 0.1% |
| 22790 | 155 | 0.1% |
| 22793 | 154 | 0.1% |
| 24230 | 141 | 0.1% |
| 22775 | 130 | 0.1% |
| 35162 | 125 | 0.1% |
| 29101 | 119 | 0.1% |
| 11740 | 111 | 0.1% |
| 13087 | 108 | 0.1% |
| 38400 | 106 | 0.1% |
| Other values (14984) | 117830 |
| Value | Count | Frequency (%) |
| 1003 | 1 | < 0.1% |
| 1004 | 2 | < 0.1% |
| 1005 | 6 | |
| 1006 | 2 | < 0.1% |
| 1007 | 4 | |
| 1008 | 4 | |
| 1009 | 8 | |
| 1011 | 6 | |
| 1012 | 3 | < 0.1% |
| 1013 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99990 | 1 | < 0.1% |
| 99980 | 3 | < 0.1% |
| 99970 | 1 | < 0.1% |
| 99965 | 2 | < 0.1% |
| 99960 | 2 | < 0.1% |
| 99955 | 3 | < 0.1% |
| 99950 | 9 | |
| 99940 | 2 | < 0.1% |
| 99930 | 5 | |
| 99925 | 1 | < 0.1% |
customer_city
Text
| Distinct | 4119 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.33532 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1231381 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1036 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | sao paulo |
|---|---|
| 2nd row | sao paulo |
| 3rd row | sao paulo |
| 4th row | sao paulo |
| 5th row | sao paulo |
| Value | Count | Frequency (%) |
| sao | 25445 | 12.2% |
| paulo | 18957 | 9.1% |
| de | 11657 | 5.6% |
| rio | 9967 | 4.8% |
| janeiro | 8311 | 4.0% |
| do | 5095 | 2.4% |
| belo | 3373 | 1.6% |
| horizonte | 3327 | 1.6% |
| brasilia | 2510 | 1.2% |
| porto | 1998 | 1.0% |
| Other values (3285) | 118347 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 203082 | |
| o | 151991 | |
| i | 94150 | 7.6% |
| r | 91236 | 7.4% |
| 89844 | 7.3% | |
| e | 79953 | 6.5% |
| s | 75446 | 6.1% |
| n | 54566 | 4.4% |
| u | 54064 | 4.4% |
| l | 53676 | 4.4% |
| Other values (21) | 283373 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1140982 | |
| Space Separator | 89844 | 7.3% |
| Dash Punctuation | 290 | < 0.1% |
| Other Punctuation | 263 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 203082 | |
| o | 151991 | |
| i | 94150 | 8.3% |
| r | 91236 | 8.0% |
| e | 79953 | 7.0% |
| s | 75446 | 6.6% |
| n | 54566 | 4.8% |
| u | 54064 | 4.7% |
| l | 53676 | 4.7% |
| p | 44782 | 3.9% |
| Other values (16) | 238036 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 89844 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 290 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 263 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1140982 | |
| Common | 90399 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 203082 | |
| o | 151991 | |
| i | 94150 | 8.3% |
| r | 91236 | 8.0% |
| e | 79953 | 7.0% |
| s | 75446 | 6.6% |
| n | 54566 | 4.8% |
| u | 54064 | 4.7% |
| l | 53676 | 4.7% |
| p | 44782 | 3.9% |
| Other values (16) | 238036 |
Common
| Value | Count | Frequency (%) |
| 89844 | ||
| - | 290 | 0.3% |
| ' | 263 | 0.3% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1231381 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 203082 | |
| o | 151991 | |
| i | 94150 | 7.6% |
| r | 91236 | 7.4% |
| 89844 | 7.3% | |
| e | 79953 | 6.5% |
| s | 75446 | 6.1% |
| n | 54566 | 4.4% |
| u | 54064 | 4.4% |
| l | 53676 | 4.4% |
| Other values (21) | 283373 |
customer_state
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| SP | |
|---|---|
| RJ | |
| MG | |
| RS | |
| PR | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 238286 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
| 3rd row | SP |
| 4th row | SP |
| 5th row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 50265 | |
| RJ | 15518 | 13.0% |
| MG | 13819 | 11.6% |
| RS | 6573 | 5.5% |
| PR | 6043 | 5.1% |
| SC | 4345 | 3.6% |
| BA | 4091 | 3.4% |
| DF | 2516 | 2.1% |
| GO | 2466 | 2.1% |
| ES | 2360 | 2.0% |
| Other values (17) | 11147 | 9.4% |
Length
| Value | Count | Frequency (%) |
| sp | 50265 | |
| rj | 15518 | 13.0% |
| mg | 13819 | 11.6% |
| rs | 6573 | 5.5% |
| pr | 6043 | 5.1% |
| sc | 4345 | 3.6% |
| ba | 4091 | 3.4% |
| df | 2516 | 2.1% |
| go | 2466 | 2.1% |
| es | 2360 | 2.0% |
| Other values (17) | 11147 | 9.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 64808 | |
| P | 60647 | |
| R | 29104 | |
| M | 16842 | 7.1% |
| G | 16285 | 6.8% |
| J | 15518 | 6.5% |
| A | 6892 | 2.9% |
| E | 6234 | 2.6% |
| C | 6005 | 2.5% |
| B | 4735 | 2.0% |
| Other values (7) | 11216 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 238286 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 64808 | |
| P | 60647 | |
| R | 29104 | |
| M | 16842 | 7.1% |
| G | 16285 | 6.8% |
| J | 15518 | 6.5% |
| A | 6892 | 2.9% |
| E | 6234 | 2.6% |
| C | 6005 | 2.5% |
| B | 4735 | 2.0% |
| Other values (7) | 11216 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 238286 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 64808 | |
| P | 60647 | |
| R | 29104 | |
| M | 16842 | 7.1% |
| G | 16285 | 6.8% |
| J | 15518 | 6.5% |
| A | 6892 | 2.9% |
| E | 6234 | 2.6% |
| C | 6005 | 2.5% |
| B | 4735 | 2.0% |
| Other values (7) | 11216 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 64808 | |
| P | 60647 | |
| R | 29104 | |
| M | 16842 | 7.1% |
| G | 16285 | 6.8% |
| J | 15518 | 6.5% |
| A | 6892 | 2.9% |
| E | 6234 | 2.6% |
| C | 6005 | 2.5% |
| B | 4735 | 2.0% |
| Other values (7) | 11216 | 4.7% |
seller_zip_code_prefix
Real number (ℝ)
| Distinct | 2246 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24442.41 |
| Minimum | 1001 |
|---|---|
| Maximum | 99730 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 2972 |
| Q1 | 6429 |
| median | 13660 |
| Q3 | 27972 |
| 95-th percentile | 88308 |
| Maximum | 99730 |
| Range | 98729 |
| Interquartile range (IQR) | 21543 |
Descriptive statistics
| Standard deviation | 27573.005 |
|---|---|
| Coefficient of variation (CV) | 1.1280804 |
| Kurtosis | 0.93697995 |
| Mean | 24442.41 |
| Median Absolute Deviation (MAD) | 8123 |
| Skewness | 1.5561361 |
| Sum | 2.8917816 × 109 |
| Variance | 7.6027058 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14940 | 8373 | 7.0% |
| 5849 | 2145 | 1.8% |
| 15025 | 2098 | 1.8% |
| 9015 | 1899 | 1.6% |
| 13405 | 1678 | 1.4% |
| 8577 | 1556 | 1.3% |
| 4782 | 1549 | 1.3% |
| 3204 | 1477 | 1.2% |
| 4160 | 1268 | 1.1% |
| 13232 | 1255 | 1.1% |
| Other values (2236) | 95012 |
| Value | Count | Frequency (%) |
| 1001 | 22 | < 0.1% |
| 1021 | 41 | < 0.1% |
| 1022 | 5 | < 0.1% |
| 1023 | 5 | < 0.1% |
| 1026 | 323 | |
| 1031 | 129 | 0.1% |
| 1035 | 18 | < 0.1% |
| 1039 | 1 | < 0.1% |
| 1040 | 25 | < 0.1% |
| 1041 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 99730 | 12 | < 0.1% |
| 99700 | 2 | < 0.1% |
| 99670 | 1 | < 0.1% |
| 99500 | 61 | |
| 99300 | 2 | < 0.1% |
| 98975 | 22 | < 0.1% |
| 98920 | 2 | < 0.1% |
| 98910 | 14 | < 0.1% |
| 98803 | 66 | |
| 98780 | 4 | < 0.1% |
seller_city
Text
| Distinct | 611 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Memory size | 1.8 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 31 |
| Mean length | 10.102451 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1195221 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 9 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | maua |
|---|---|
| 2nd row | maua |
| 3rd row | maua |
| 4th row | maua |
| 5th row | maua |
| Value | Count | Frequency (%) |
| sao | 36362 | 17.9% |
| paulo | 29574 | 14.5% |
| ibitinga | 8373 | 4.1% |
| rio | 5930 | 2.9% |
| do | 5524 | 2.7% |
| preto | 5518 | 2.7% |
| de | 4192 | 2.1% |
| jose | 4085 | 2.0% |
| santo | 3270 | 1.6% |
| andre | 3164 | 1.6% |
| Other values (640) | 97267 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 198772 | |
| o | 146215 | |
| i | 102111 | 8.5% |
| 85009 | 7.1% | |
| r | 78220 | 6.5% |
| s | 76216 | 6.4% |
| e | 64170 | 5.4% |
| u | 62907 | 5.3% |
| p | 58419 | 4.9% |
| l | 56917 | 4.8% |
| Other values (31) | 266265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1108993 | |
| Space Separator | 85009 | 7.1% |
| Other Punctuation | 614 | 0.1% |
| Modifier Symbol | 369 | < 0.1% |
| Dash Punctuation | 164 | < 0.1% |
| Close Punctuation | 31 | < 0.1% |
| Open Punctuation | 31 | < 0.1% |
| Decimal Number | 8 | < 0.1% |
| Nonspacing Mark | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 198772 | |
| o | 146215 | |
| i | 102111 | |
| r | 78220 | 7.1% |
| s | 76216 | 6.9% |
| e | 64170 | 5.8% |
| u | 62907 | 5.7% |
| p | 58419 | 5.3% |
| l | 56917 | 5.1% |
| t | 47317 | 4.3% |
| Other values (14) | 217729 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 347 | |
| / | 141 | |
| . | 76 | 12.4% |
| @ | 38 | 6.2% |
| \ | 6 | 1.0% |
| , | 6 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 2 | 2 | |
| 5 | 2 | |
| 0 | 1 | |
| 8 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 85009 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 369 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 164 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 31 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 31 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̃ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1108993 | |
| Common | 86226 | 7.2% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 198772 | |
| o | 146215 | |
| i | 102111 | |
| r | 78220 | 7.1% |
| s | 76216 | 6.9% |
| e | 64170 | 5.8% |
| u | 62907 | 5.7% |
| p | 58419 | 5.3% |
| l | 56917 | 5.1% |
| t | 47317 | 4.3% |
| Other values (14) | 217729 |
Common
| Value | Count | Frequency (%) |
| 85009 | ||
| ´ | 369 | 0.4% |
| ' | 347 | 0.4% |
| - | 164 | 0.2% |
| / | 141 | 0.2% |
| . | 76 | 0.1% |
| @ | 38 | < 0.1% |
| ) | 31 | < 0.1% |
| ( | 31 | < 0.1% |
| \ | 6 | < 0.1% |
| Other values (6) | 14 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ̃ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1194850 | |
| None | 369 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 198772 | |
| o | 146215 | |
| i | 102111 | 8.5% |
| 85009 | 7.1% | |
| r | 78220 | 6.5% |
| s | 76216 | 6.4% |
| e | 64170 | 5.4% |
| u | 62907 | 5.3% |
| p | 58419 | 4.9% |
| l | 56917 | 4.8% |
| Other values (29) | 265894 |
None
| Value | Count | Frequency (%) |
| ´ | 369 |
Diacriticals
| Value | Count | Frequency (%) |
| ̃ | 2 |
seller_state
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 833 |
| Missing (%) | 0.7% |
| Memory size | 1.8 MiB |
| SP | |
|---|---|
| MG | |
| PR | |
| RJ | 5036 |
| SC | 4271 |
| Other values (18) | 6216 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 236620 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
| 3rd row | SP |
| 4th row | SP |
| 5th row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 84377 | |
| MG | 9314 | 7.8% |
| PR | 9096 | 7.6% |
| RJ | 5036 | 4.2% |
| SC | 4271 | 3.6% |
| RS | 2294 | 1.9% |
| DF | 949 | 0.8% |
| BA | 700 | 0.6% |
| GO | 550 | 0.5% |
| PE | 465 | 0.4% |
| Other values (13) | 1258 | 1.1% |
| (Missing) | 833 | 0.7% |
Length
| Value | Count | Frequency (%) |
| sp | 84377 | |
| mg | 9314 | 7.9% |
| pr | 9096 | 7.7% |
| rj | 5036 | 4.3% |
| sc | 4271 | 3.6% |
| rs | 2294 | 1.9% |
| df | 949 | 0.8% |
| ba | 700 | 0.6% |
| go | 550 | 0.5% |
| pe | 465 | 0.4% |
| Other values (13) | 1258 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 94002 | |
| S | 91402 | |
| R | 16496 | 7.0% |
| M | 9934 | 4.2% |
| G | 9864 | 4.2% |
| J | 5036 | 2.1% |
| C | 4375 | 1.8% |
| A | 1122 | 0.5% |
| E | 968 | 0.4% |
| D | 949 | 0.4% |
| Other values (6) | 2472 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 236620 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 94002 | |
| S | 91402 | |
| R | 16496 | 7.0% |
| M | 9934 | 4.2% |
| G | 9864 | 4.2% |
| J | 5036 | 2.1% |
| C | 4375 | 1.8% |
| A | 1122 | 0.5% |
| E | 968 | 0.4% |
| D | 949 | 0.4% |
| Other values (6) | 2472 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 236620 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 94002 | |
| S | 91402 | |
| R | 16496 | 7.0% |
| M | 9934 | 4.2% |
| G | 9864 | 4.2% |
| J | 5036 | 2.1% |
| C | 4375 | 1.8% |
| A | 1122 | 0.5% |
| E | 968 | 0.4% |
| D | 949 | 0.4% |
| Other values (6) | 2472 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 236620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 94002 | |
| S | 91402 | |
| R | 16496 | 7.0% |
| M | 9934 | 4.2% |
| G | 9864 | 4.2% |
| J | 5036 | 2.1% |
| C | 4375 | 1.8% |
| A | 1122 | 0.5% |
| E | 968 | 0.4% |
| D | 949 | 0.4% |
| Other values (6) | 2472 | 1.0% |
| order_item_id | price | freight_value | payment_sequential | payment_installments | payment_value | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | customer_zip_code_prefix | seller_zip_code_prefix | order_status | payment_type | review_score | customer_state | seller_state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| order_item_id | 1.000 | -0.116 | -0.056 | -0.008 | 0.061 | 0.257 | -0.021 | -0.032 | -0.065 | 0.000 | 0.007 | 0.018 | -0.004 | -0.009 | -0.011 | 0.002 | 0.021 | 0.041 | 0.013 | 0.000 |
| price | -0.116 | 1.000 | 0.433 | -0.005 | 0.315 | 0.789 | 0.042 | 0.211 | 0.029 | 0.514 | 0.266 | 0.327 | 0.271 | 0.070 | 0.175 | 0.014 | 0.014 | 0.012 | 0.020 | 0.052 |
| freight_value | -0.056 | 0.433 | 1.000 | 0.017 | 0.191 | 0.423 | 0.034 | 0.117 | 0.011 | 0.448 | 0.284 | 0.284 | 0.275 | 0.466 | 0.257 | 0.015 | 0.009 | 0.015 | 0.085 | 0.048 |
| payment_sequential | -0.008 | -0.005 | 0.017 | 1.000 | -0.178 | -0.215 | -0.003 | -0.013 | -0.005 | 0.030 | 0.034 | 0.013 | 0.028 | -0.009 | 0.007 | 0.026 | 0.198 | 0.012 | 0.026 | 0.016 |
| payment_installments | 0.061 | 0.315 | 0.191 | -0.178 | 1.000 | 0.395 | 0.016 | 0.033 | -0.003 | 0.198 | 0.109 | 0.106 | 0.125 | 0.069 | 0.065 | 0.005 | 0.236 | 0.027 | 0.032 | 0.033 |
| payment_value | 0.257 | 0.789 | 0.423 | -0.215 | 0.395 | 1.000 | 0.025 | 0.169 | -0.011 | 0.449 | 0.229 | 0.305 | 0.232 | 0.106 | 0.160 | 0.015 | 0.018 | 0.028 | 0.029 | 0.036 |
| product_name_lenght | -0.021 | 0.042 | 0.034 | -0.003 | 0.016 | 0.025 | 1.000 | 0.073 | 0.163 | 0.076 | 0.060 | -0.057 | 0.065 | 0.015 | 0.009 | 0.019 | 0.011 | 0.013 | 0.013 | 0.070 |
| product_description_lenght | -0.032 | 0.211 | 0.117 | -0.013 | 0.033 | 0.169 | 0.073 | 1.000 | 0.111 | 0.095 | -0.021 | 0.135 | -0.081 | 0.031 | 0.001 | 0.016 | 0.020 | 0.015 | 0.026 | 0.112 |
| product_photos_qty | -0.065 | 0.029 | 0.011 | -0.005 | -0.003 | -0.011 | 0.163 | 0.111 | 1.000 | 0.003 | 0.005 | -0.079 | -0.015 | 0.026 | -0.078 | 0.013 | 0.004 | 0.016 | 0.014 | 0.040 |
| product_weight_g | 0.000 | 0.514 | 0.448 | 0.030 | 0.198 | 0.449 | 0.076 | 0.095 | 0.003 | 1.000 | 0.619 | 0.532 | 0.621 | 0.026 | 0.096 | 0.011 | 0.018 | 0.020 | 0.028 | 0.078 |
| product_length_cm | 0.007 | 0.266 | 0.284 | 0.034 | 0.109 | 0.229 | 0.060 | -0.021 | 0.005 | 0.619 | 1.000 | 0.248 | 0.632 | 0.008 | 0.067 | 0.014 | 0.022 | 0.018 | 0.017 | 0.084 |
| product_height_cm | 0.018 | 0.327 | 0.284 | 0.013 | 0.106 | 0.305 | -0.057 | 0.135 | -0.079 | 0.532 | 0.248 | 1.000 | 0.338 | 0.019 | 0.049 | 0.016 | 0.015 | 0.018 | 0.019 | 0.065 |
| product_width_cm | -0.004 | 0.271 | 0.275 | 0.028 | 0.125 | 0.232 | 0.065 | -0.081 | -0.015 | 0.621 | 0.632 | 0.338 | 1.000 | -0.002 | 0.077 | 0.004 | 0.020 | 0.014 | 0.016 | 0.057 |
| customer_zip_code_prefix | -0.009 | 0.070 | 0.466 | -0.009 | 0.069 | 0.106 | 0.015 | 0.031 | 0.026 | 0.026 | 0.008 | 0.019 | -0.002 | 1.000 | 0.060 | 0.022 | 0.029 | 0.041 | 0.896 | 0.065 |
| seller_zip_code_prefix | -0.011 | 0.175 | 0.257 | 0.007 | 0.065 | 0.160 | 0.009 | 0.001 | -0.078 | 0.096 | 0.067 | 0.049 | 0.077 | 0.060 | 1.000 | 0.013 | 0.018 | 0.021 | 0.069 | 0.920 |
| order_status | 0.002 | 0.014 | 0.015 | 0.026 | 0.005 | 0.015 | 0.019 | 0.016 | 0.013 | 0.011 | 0.014 | 0.016 | 0.004 | 0.022 | 0.013 | 1.000 | 0.037 | 0.149 | 0.026 | 0.029 |
| payment_type | 0.021 | 0.014 | 0.009 | 0.198 | 0.236 | 0.018 | 0.011 | 0.020 | 0.004 | 0.018 | 0.022 | 0.015 | 0.020 | 0.029 | 0.018 | 0.037 | 1.000 | 0.010 | 0.033 | 0.021 |
| review_score | 0.041 | 0.012 | 0.015 | 0.012 | 0.027 | 0.028 | 0.013 | 0.015 | 0.016 | 0.020 | 0.018 | 0.018 | 0.014 | 0.041 | 0.021 | 0.149 | 0.010 | 1.000 | 0.048 | 0.023 |
| customer_state | 0.013 | 0.020 | 0.085 | 0.026 | 0.032 | 0.029 | 0.013 | 0.026 | 0.014 | 0.028 | 0.017 | 0.019 | 0.016 | 0.896 | 0.069 | 0.026 | 0.033 | 0.048 | 1.000 | 0.053 |
| seller_state | 0.000 | 0.052 | 0.048 | 0.016 | 0.033 | 0.036 | 0.070 | 0.112 | 0.040 | 0.078 | 0.084 | 0.065 | 0.057 | 0.065 | 0.920 | 0.029 | 0.021 | 0.023 | 0.053 | 1.000 |
| order_id | customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | order_item_id | product_id | seller_id | shipping_limit_date | price | freight_value | payment_sequential | payment_type | payment_installments | payment_value | review_id | review_score | review_comment_title | review_comment_message | review_creation_date | review_answer_timestamp | product_category_name | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | seller_zip_code_prefix | seller_city | seller_state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | e481f51cbdc54678b7cc49136f2d6af7 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 2017-10-02 11:07:15 | 2017-10-04 19:55:00 | 2017-10-10 21:25:13 | 2017-10-18 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-10-06 11:07:15 | 29.99 | 8.72 | 1.0 | credit_card | 1.0 | 18.12 | a54f0611adc9ed256b57ede6b6eb5114 | 4.0 | NaN | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. | 2017-10-11 00:00:00 | 2017-10-12 03:43:48 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | 7c396fd4830fd04220f754e42b4e5bff | 3149 | sao paulo | SP | 9350.0 | maua | SP |
| 1 | e481f51cbdc54678b7cc49136f2d6af7 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 2017-10-02 11:07:15 | 2017-10-04 19:55:00 | 2017-10-10 21:25:13 | 2017-10-18 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-10-06 11:07:15 | 29.99 | 8.72 | 3.0 | voucher | 1.0 | 2.00 | a54f0611adc9ed256b57ede6b6eb5114 | 4.0 | NaN | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. | 2017-10-11 00:00:00 | 2017-10-12 03:43:48 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | 7c396fd4830fd04220f754e42b4e5bff | 3149 | sao paulo | SP | 9350.0 | maua | SP |
| 2 | e481f51cbdc54678b7cc49136f2d6af7 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 2017-10-02 11:07:15 | 2017-10-04 19:55:00 | 2017-10-10 21:25:13 | 2017-10-18 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-10-06 11:07:15 | 29.99 | 8.72 | 2.0 | voucher | 1.0 | 18.59 | a54f0611adc9ed256b57ede6b6eb5114 | 4.0 | NaN | Não testei o produto ainda, mas ele veio correto e em boas condições. Apenas a caixa que veio bem amassada e danificada, o que ficará chato, pois se trata de um presente. | 2017-10-11 00:00:00 | 2017-10-12 03:43:48 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | 7c396fd4830fd04220f754e42b4e5bff | 3149 | sao paulo | SP | 9350.0 | maua | SP |
| 3 | 128e10d95713541c87cd1a2e48201934 | a20e8105f23924cd00833fd87daa0831 | delivered | 2017-08-15 18:29:31 | 2017-08-15 20:05:16 | 2017-08-17 15:28:33 | 2017-08-18 14:44:43 | 2017-08-28 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-21 20:05:16 | 29.99 | 7.78 | 1.0 | credit_card | 3.0 | 37.77 | b46f1e34512b0f4c74a72398b03ca788 | 4.0 | NaN | Deveriam embalar melhor o produto. A caixa veio toda amassada e vou dar de presente. | 2017-08-19 00:00:00 | 2017-08-20 15:16:36 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | 3a51803cc0d012c3b5dc8b7528cb05f7 | 3366 | sao paulo | SP | 9350.0 | maua | SP |
| 4 | 0e7e841ddf8f8f2de2bad69267ecfbcf | 26c7ac168e1433912a51b924fbd34d34 | delivered | 2017-08-02 18:24:47 | 2017-08-02 18:43:15 | 2017-08-04 17:35:43 | 2017-08-07 18:30:01 | 2017-08-15 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-08 18:37:31 | 29.99 | 7.78 | 1.0 | credit_card | 1.0 | 37.77 | dc90f19c2806f1abba9e72ad3c350073 | 5.0 | NaN | Só achei ela pequena pra seis xícaras ,mais é um bom produto\r\n | 2017-08-08 00:00:00 | 2017-08-08 23:26:23 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | ef0996a1a279c26e7ecbd737be23d235 | 2290 | sao paulo | SP | 9350.0 | maua | SP |
| 5 | bfc39df4f36c3693ff3b63fcbea9e90a | 53904ddbea91e1e92b2b3f1d09a7af86 | delivered | 2017-10-23 23:26:46 | 2017-10-25 02:14:11 | 2017-10-27 16:48:46 | 2017-11-07 18:04:59 | 2017-11-13 00:00:00 | 1.0 | 87285b34884572647811a353c7ac498a | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-10-31 02:14:11 | 29.99 | 14.10 | 1.0 | boleto | 1.0 | 44.09 | 1bafb430e498b939f258b9c9dbdff9b1 | 3.0 | NaN | NaN | 2017-11-08 00:00:00 | 2017-11-10 19:52:38 | utilidades_domesticas | 40.0 | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | e781fdcc107d13d865fc7698711cc572 | 88032 | florianopolis | SC | 9350.0 | maua | SP |
| 6 | 8736140c61ea584cb4250074756d8f3b | ab8844663ae049fda8baf15fc928f47f | delivered | 2017-08-10 13:35:55 | 2017-08-10 13:50:09 | 2017-08-11 13:52:35 | 2017-08-16 19:03:36 | 2017-08-23 00:00:00 | 1.0 | b00a32a0b42fd65efb58a5822009f629 | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-16 13:50:09 | 75.90 | 7.79 | 1.0 | credit_card | 1.0 | 83.69 | b8238c6515192f8129081e17dc57d169 | 5.0 | NaN | custo beneficio, simples de usar e rápido | 2017-08-17 00:00:00 | 2017-08-21 12:43:27 | bebes | 58.0 | 398.0 | 3.0 | 238.0 | 20.0 | 10.0 | 15.0 | 02c9e0c05a817d4562ec0e8c90f29dba | 8577 | itaquaquecetuba | SP | 9350.0 | maua | SP |
| 7 | 88407c8c6e12493ff6e845df39540112 | e902cb9d9992a69a267f69dec57aa3a3 | delivered | 2017-08-15 02:03:01 | 2017-08-15 02:15:13 | 2017-08-16 15:52:29 | 2017-08-25 21:59:26 | 2017-08-28 00:00:00 | 1.0 | b00a32a0b42fd65efb58a5822009f629 | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-21 02:15:13 | 75.90 | 7.79 | 1.0 | credit_card | 2.0 | 83.69 | 186b702b3817fd5cc00b201b11764d63 | 4.0 | NaN | muitro bom o produto chegou dentro do prazo. | 2017-08-26 00:00:00 | 2017-08-28 20:10:38 | bebes | 58.0 | 398.0 | 3.0 | 238.0 | 20.0 | 10.0 | 15.0 | 28adbfbaf0b9c5e5a0555a8c853a7534 | 13060 | campinas | SP | 9350.0 | maua | SP |
| 8 | 4f2acff0b7d2bcc4a408abe5a223d407 | d67b6cca5a87299f711a6961f579fe67 | delivered | 2017-08-01 16:31:35 | 2017-08-02 02:50:25 | 2017-08-03 14:36:34 | 2017-08-09 19:56:50 | 2017-08-23 00:00:00 | 1.0 | b00a32a0b42fd65efb58a5822009f629 | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-08 02:50:25 | 75.90 | 14.28 | 1.0 | boleto | 1.0 | 90.18 | 567900cb1263f2ee7341989937a789cc | 5.0 | NaN | NaN | 2017-08-10 00:00:00 | 2017-08-11 21:08:38 | bebes | 58.0 | 398.0 | 3.0 | 238.0 | 20.0 | 10.0 | 15.0 | aea90564d6f09ae11bf936f55ed49d72 | 82030 | curitiba | PR | 9350.0 | maua | SP |
| 9 | 019aaee09698daf81dcffe9d94a18b5c | e3893e579755de4feb1a4d0313c103fa | delivered | 2017-08-10 14:04:58 | 2017-08-10 14:23:38 | 2017-08-11 13:52:35 | 2017-08-12 11:56:49 | 2017-08-23 00:00:00 | 1.0 | b00a32a0b42fd65efb58a5822009f629 | 3504c0cb71d7fa48d967e0e4c94d59d9 | 2017-08-16 14:23:38 | 75.90 | 7.79 | 1.0 | credit_card | 2.0 | 83.69 | 43334848a48a7abf6faa2f8aba675b8a | 2.0 | NaN | tudo correu bem com a loja e com a entrega mas o produto não funcionou, vou devolver | 2017-08-13 00:00:00 | 2017-08-14 12:24:58 | bebes | 58.0 | 398.0 | 3.0 | 238.0 | 20.0 | 10.0 | 15.0 | cd6b577df45c00daa6b2767eaa947c72 | 13092 | campinas | SP | 9350.0 | maua | SP |
| order_id | customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | order_item_id | product_id | seller_id | shipping_limit_date | price | freight_value | payment_sequential | payment_type | payment_installments | payment_value | review_id | review_score | review_comment_title | review_comment_message | review_creation_date | review_answer_timestamp | product_category_name | product_name_lenght | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | customer_unique_id | customer_zip_code_prefix | customer_city | customer_state | seller_zip_code_prefix | seller_city | seller_state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 119133 | f5f8998eee8ec7bc513dc52847d64ce0 | f4656b824844a039a87fd9c51ad3586a | canceled | 2018-03-01 11:42:23 | 2018-03-01 12:20:32 | NaN | NaN | 2018-03-20 00:00:00 | 1.0 | 51bd37bb8517d5bfdb1f54c11fb01d27 | f09e26011d833ddab11593c1a097a92a | 2018-03-08 12:20:32 | 79.90 | 22.19 | 1.0 | credit_card | 2.0 | 102.09 | bdf24af3e04cf534d9bee6afd037c1a0 | 1.0 | NaN | NaN | 2018-03-22 00:00:00 | 2018-03-26 03:00:14 | moveis_decoracao | 43.0 | 87.0 | 1.0 | 3500.0 | 20.0 | 20.0 | 20.0 | 149164aee69ed656dedbbe68623157bc | 13469 | americana | SP | 13632.0 | pirassununga | SP |
| 119134 | 5bacbd9f42bd029c3a296501224e193e | 5a1470d43d8ad960d4199134d3df48e0 | delivered | 2018-08-10 21:14:35 | 2018-08-10 21:25:22 | 2018-08-13 13:54:00 | 2018-08-21 04:16:31 | 2018-08-30 00:00:00 | 1.0 | 710e8b076db06c8e5343a9e23f0e3d83 | 8dd386be0767c330276ea6a3f96532d3 | 2018-08-15 21:25:22 | 44.99 | 22.25 | 1.0 | credit_card | 2.0 | 134.48 | f91f12b20162095a387a237e114e7d67 | 5.0 | NaN | NaN | 2018-08-21 00:00:00 | 2018-08-21 22:01:55 | esporte_lazer | 60.0 | 645.0 | 2.0 | 600.0 | 30.0 | 20.0 | 20.0 | 0b39f417a3c099ff0497346258e8d752 | 39810 | carai | MG | 88490.0 | paulo lopes | SC |
| 119135 | 5bacbd9f42bd029c3a296501224e193e | 5a1470d43d8ad960d4199134d3df48e0 | delivered | 2018-08-10 21:14:35 | 2018-08-10 21:25:22 | 2018-08-13 13:54:00 | 2018-08-21 04:16:31 | 2018-08-30 00:00:00 | 2.0 | 710e8b076db06c8e5343a9e23f0e3d83 | 8dd386be0767c330276ea6a3f96532d3 | 2018-08-15 21:25:22 | 44.99 | 22.25 | 1.0 | credit_card | 2.0 | 134.48 | f91f12b20162095a387a237e114e7d67 | 5.0 | NaN | NaN | 2018-08-21 00:00:00 | 2018-08-21 22:01:55 | esporte_lazer | 60.0 | 645.0 | 2.0 | 600.0 | 30.0 | 20.0 | 20.0 | 0b39f417a3c099ff0497346258e8d752 | 39810 | carai | MG | 88490.0 | paulo lopes | SC |
| 119136 | 5a8a4dc28b16fb90469ad749f9535773 | c0c8b8bb055100a0cc08dcc04d847ac9 | canceled | 2018-03-13 10:58:09 | 2018-03-14 03:08:35 | NaN | NaN | 2018-03-23 00:00:00 | 1.0 | 33ac889bc3af4ddede9c14fc789a3743 | 666658b8da8370f30e1f89893b1de5e6 | 2018-03-20 03:08:35 | 149.00 | 11.67 | 1.0 | boleto | 1.0 | 321.34 | ec03cc18869f8509f9d3fbe2d106cea7 | 5.0 | NaN | NaN | 2018-03-28 00:00:00 | 2018-03-28 18:11:45 | ferramentas_jardim | 28.0 | 682.0 | 1.0 | 1700.0 | 30.0 | 5.0 | 30.0 | 82ec5f749b66f1857e868b6414a67ab3 | 6765 | taboao da serra | SP | 3658.0 | sao paulo | SP |
| 119137 | 5a8a4dc28b16fb90469ad749f9535773 | c0c8b8bb055100a0cc08dcc04d847ac9 | canceled | 2018-03-13 10:58:09 | 2018-03-14 03:08:35 | NaN | NaN | 2018-03-23 00:00:00 | 2.0 | 33ac889bc3af4ddede9c14fc789a3743 | 666658b8da8370f30e1f89893b1de5e6 | 2018-03-20 03:08:35 | 149.00 | 11.67 | 1.0 | boleto | 1.0 | 321.34 | ec03cc18869f8509f9d3fbe2d106cea7 | 5.0 | NaN | NaN | 2018-03-28 00:00:00 | 2018-03-28 18:11:45 | ferramentas_jardim | 28.0 | 682.0 | 1.0 | 1700.0 | 30.0 | 5.0 | 30.0 | 82ec5f749b66f1857e868b6414a67ab3 | 6765 | taboao da serra | SP | 3658.0 | sao paulo | SP |
| 119138 | 1ab38815794efa43d269d62b98dae815 | a0b67404d84a70ef420a7f99ad6b190a | delivered | 2018-07-01 10:23:10 | 2018-07-05 16:17:52 | 2018-07-04 14:34:00 | 2018-07-09 15:06:57 | 2018-07-20 00:00:00 | 1.0 | 31ec3a565e06de4bdf9d2a511b822b4d | babcc0ab201e4c60188427cae51a5b8b | 2018-07-10 08:32:33 | 79.00 | 14.13 | 1.0 | boleto | 1.0 | 93.13 | 7f9849fcbfdf9fa3070c05b5501bf066 | 5.0 | NaN | NaN | 2018-07-10 00:00:00 | 2018-07-10 18:32:29 | construcao_ferramentas_iluminacao | 40.0 | 516.0 | 2.0 | 750.0 | 30.0 | 28.0 | 28.0 | 2077f7ec37df79c62cc24b7b8f30e8c9 | 8528 | ferraz de vasconcelos | SP | 13660.0 | porto ferreira | SP |
| 119139 | b159d0ce7cd881052da94fa165617b05 | e0c3bc5ce0836b975d6b2a8ce7bb0e3e | canceled | 2017-03-11 19:51:36 | 2017-03-11 19:51:36 | NaN | NaN | 2017-03-30 00:00:00 | 1.0 | 241a1ffc9cf969b27de6e72301020268 | 8501d82f68d23148b6d78bb7c4a42037 | 2017-03-16 19:51:36 | 19.70 | 10.96 | 1.0 | credit_card | 1.0 | 30.66 | c950324a42c5796d06f569f77d8b2e88 | 1.0 | NaN | NaN | 2017-04-01 00:00:00 | 2017-04-01 10:24:03 | automotivo | 48.0 | 260.0 | 2.0 | 400.0 | 16.0 | 4.0 | 11.0 | 78a159045124eb7601951b917a42034f | 89111 | gaspar | SC | 89031.0 | blumenau | SC |
| 119140 | 735dce2d574afe8eb87e80a3d6229c48 | d531d01affc2c55769f6b9ed410d8d3c | delivered | 2018-07-24 09:46:27 | 2018-07-24 11:24:27 | 2018-07-24 15:14:00 | 2018-08-02 22:47:35 | 2018-08-16 00:00:00 | 1.0 | 1d187e8e7a30417fda31e85679d96f0f | d263fa444c1504a75cbca5cc465f592a | 2018-07-30 11:24:27 | 399.00 | 45.07 | 1.0 | debit_card | 1.0 | 444.07 | 19f21ead7ffe5b1b5147a7877c22bae5 | 5.0 | NaN | NaN | 2018-08-03 00:00:00 | 2018-08-04 11:22:40 | moveis_decoracao | 43.0 | 729.0 | 2.0 | 2100.0 | 80.0 | 8.0 | 30.0 | 8cf3c6e1d2c8afaab2eda3fa01d4e3d2 | 60455 | fortaleza | CE | 13478.0 | americana | SP |
| 119141 | 25d2bfa43663a23586afd12f15b542e7 | 9d8c06734fde9823ace11a4b5929b5a7 | delivered | 2018-05-22 21:13:21 | 2018-05-22 21:35:40 | 2018-05-24 12:28:00 | 2018-06-12 23:11:29 | 2018-06-08 00:00:00 | 1.0 | 6e1c2008dea1929b9b6c27fa01381e90 | edf3fabebcc20f7463cc9c53da932ea8 | 2018-05-28 21:31:24 | 219.90 | 24.12 | 1.0 | credit_card | 4.0 | 244.02 | ec2817e750153dfdd61894780dfc5d9e | 4.0 | NaN | NaN | 2018-06-10 00:00:00 | 2018-06-13 09:17:47 | moveis_decoracao | 19.0 | 531.0 | 1.0 | 5900.0 | 41.0 | 21.0 | 41.0 | e55e436481078787e32349cee9febf5e | 39803 | teofilo otoni | MG | 8320.0 | sao paulo | SP |
| 119142 | 1565f22aa9452ff278638e87cc895678 | 56772dfbcbe7df908a284ff0d53adf7d | delivered | 2018-05-15 17:41:00 | 2018-05-16 03:35:29 | 2018-05-16 17:20:00 | 2018-05-21 14:31:41 | 2018-05-29 00:00:00 | 1.0 | 9c1e194db1d35a79d962ea610bfe0868 | f3862c2188522d89860c38a3ea8b550d | 2018-05-22 03:35:29 | 15.50 | 12.79 | 1.0 | boleto | 1.0 | 28.29 | cbb879403973e209b4df371a5dafbaa7 | 5.0 | NaN | NaN | 2018-06-01 00:00:00 | 2018-06-01 15:14:23 | perfumaria | 40.0 | 871.0 | 1.0 | 83.0 | 17.0 | 8.0 | 13.0 | 6ceea7c1088e15ab3c67980a2d9bb309 | 9687 | sao bernardo do campo | SP | 14092.0 | ribeirao preto | SP |